agent-lightning icon indicating copy to clipboard operation
agent-lightning copied to clipboard

The absolute trainer to light up AI agents.

Results 140 agent-lightning issues
Sort by recently updated
recently updated
newest added

Each step will conduct rollout, so I guess the training is on-policy

question
verl

I have read your paper and code, the work is beautiful and practical. My question is as follows: Taking the SQL Agent as an example: if, in a multi-agent system,...

question
verl

In a fixed workflow with multiple roles (each defined by a distinct system prompt), AgentLightning models each role’s I/O as transitions and may group them. Without role-level rewards, is training...

question
verl
credit assignment

When I was using the local server for model training, I found that the server crashed and couldn't be connected to, showing ray OOM and other logs.

need investigation
verl
out-of-memory

Agent rollout takes a long time, can we create multiple actor instances to accelerate rollout. Currently I have to set TP = [number of GPUs] to use all gpus during...

help wanted
question
verl

How to run agentic rl with agent lightning in the distributed training environment with multi-nodes and multi-gpus (e.g., 6 machines and 8 gpus for each machine)

help wanted
question
examples
verl

In my RAG agent’s “think → search → think…” chain of operations, I sometimes edit the tokens generated by the LLM (e.g., deleting a few tokens, reordering them, etc.) before...

question
tracer

Is it possible to accelerate rollout by taking advantage of all the GPU?

waiting for reply
verl