Wendong Bi
Results
1
issues of
Wendong Bi
How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)
help wanted
question
examples
verl