feiliya333 comments

Repositories
Issues
Comments

Results 3 comments of


                                            feiliya333

[question] how many GPUs are used by 'single_node' in the default deepchat running script

i mean the default training script for deepchat.

[BUG] deepspeed Chat. OOM when running "single_node" mode because deepspeed is assigning calculating job to a small display card instead of other four A100 gpu cards

the problem has been solved! thanks so much for help!

Model performance suprisingly bad

> Hey @s-isaev no problem, this was trained on the base configurations for the 1.3B model provided on the Github. These are: > > * `training/step1_supervised_finetuning/training_scripts/single_node/run_1.3b.sh` > * `training/step2_reward_model_finetuning/training_scripts/single_node/run_350m.sh` >...