feiliya333

Results 3 comments of feiliya333

> Hey @s-isaev no problem, this was trained on the base configurations for the 1.3B model provided on the Github. These are: > > * `training/step1_supervised_finetuning/training_scripts/single_node/run_1.3b.sh` > * `training/step2_reward_model_finetuning/training_scripts/single_node/run_350m.sh` >...