wy20907104
Results
1
issues of
wy20907104
hello,i train grpo with nnodes=1, the trian config as flow: ... data.train_batch_size=256 \ actor_rollout_ref.actor.ppo_mini_batch_size=128 \ actor_rollout_ref.actor.ppo_micro_batch_size_per_gpu=8 \ ... the log is: Training Progress: 0%| | 1/540 [09:09