agent-lightning
agent-lightning copied to clipboard
The absolute trainer to light up AI agents.
Each step will conduct rollout, so I guess the training is on-policy
I have read your paper and code, the work is beautiful and practical. My question is as follows: Taking the SQL Agent as an example: if, in a multi-agent system,...
In a fixed workflow with multiple roles (each defined by a distinct system prompt), AgentLightning models each role’s I/O as transitions and may group them. Without role-level rewards, is training...
When I was using the local server for model training, I found that the server crashed and couldn't be connected to, showing ray OOM and other logs.
Agent rollout takes a long time, can we create multiple actor instances to accelerate rollout. Currently I have to set TP = [number of GPUs] to use all gpus during...
How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)
In my RAG agent’s “think → search → think…” chain of operations, I sometimes edit the tokens generated by the LLM (e.g., deleting a few tokens, reordering them, etc.) before...
Is it possible to accelerate rollout by taking advantage of all the GPU?