Wendong Bi

Results 1 issues of Wendong Bi

How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)

help wanted
question
examples
verl