agent-lightning icon indicating copy to clipboard operation
agent-lightning copied to clipboard

How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus ?

Open wendongbi opened this issue 3 months ago • 4 comments

How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)

wendongbi avatar Sep 22 '25 15:09 wendongbi

Can you give an example of the training config. thanks.

wendongbi avatar Sep 22 '25 15:09 wendongbi

The launching should be similar to verl. Please refer to the verl example.

We will need to find an environment with multi nodes and multi gpus to test this.

matluster avatar Sep 23 '25 00:09 matluster

Do you run successfully with multi nodes? @wendongbi

hzy312 avatar Oct 31 '25 03:10 hzy312

We might need an example for this.

ultmaster avatar Nov 29 '25 16:11 ultmaster