agent-lightning
agent-lightning copied to clipboard
How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus ?
How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)
Can you give an example of the training config. thanks.
The launching should be similar to verl. Please refer to the verl example.
We will need to find an environment with multi nodes and multi gpus to test this.
Do you run successfully with multi nodes? @wendongbi
We might need an example for this.