FUJIsyu0515

Results 3 comments of FUJIsyu0515

@yuhangzang @cool-xuan The methods you provided are very useful for avoiding OOM at startup. I have tried them. However, now it always suddenly appears OOM after running dozens of steps....

@czczup 请问这两个数据集最近有计划传完吗?

just use open-rlhf or verl to train with DAPO or GSPO, it's better to use frameworks that supporting distributed training for rl.