How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus ?

Open wendongbi opened this issue 3 months ago • 4 comments

How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)

Sep 22 '25 15:09 wendongbi

Can you give an example of the training config. thanks.

Sep 22 '25 15:09 wendongbi

The launching should be similar to verl. Please refer to the verl example.

We will need to find an environment with multi nodes and multi gpus to test this.

Sep 23 '25 00:09 matluster

Do you run successfully with multi nodes? @wendongbi

Oct 31 '25 03:10 hzy312

We might need an example for this.

Nov 29 '25 16:11 ultmaster