wy20907104

Results 1 issues of wy20907104

hello,i train grpo with nnodes=1, the trian config as flow: ... data.train_batch_size=256 \ actor_rollout_ref.actor.ppo_mini_batch_size=128 \ actor_rollout_ref.actor.ppo_micro_batch_size_per_gpu=8 \ ... the log is: Training Progress: 0%| | 1/540 [09:09