agent-lightning
agent-lightning copied to clipboard
The absolute trainer to light up AI agents.
Do you support agentic RL? That is, autonomously deciding to call and combine tools based on available tools to complete tasks; I see that the examples using LangChain are all...
curl "http://localhost:44413/v1/chat/completions" \ -H "Content-Type: application/json" \ -d '{ "model": "meta-llama/Llama-3.2-1B", "messages": [ {"role": "user", "content": "what was my last input to you"} ] }' Got output in terminal {"object":"error","message":"As...
**error comes from this directory: examples/calc_x** Processing request of type ListToolsRequest Processing request of type ListToolsRequest Processing request of type ListToolsRequest Processing request of type ListToolsRequest 127.0.0.1 - - [14/Aug/2025...
An error happens when we execute: ```python import os from agentlightning import Trainer, DevTaskLoader, LLM from calc_agent import CalcAgent def dev_task_loader() -> DevTaskLoader: return DevTaskLoader( tasks=[ { "question": "What is...
when i try agent-lightning/examples/rag example python rag_agent.py Traceback (most recent call last): File "/usr/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrap self.run() File "/usr/lib/python3.10/multiprocessing/process.py", line 108, in run self._target(*self._args, **self._kwargs) File "/workspace/verl/agent-lightning/agentlightning/trainer.py", line...
Thank you for your work. Do you provide a web UI to showcase the entire interaction process? For example, can I visualize chatting with different agents and generate a report,...
Calc server that orchestrates end-to-end runs for the Calc‑X example. It starts an AgentLightning server, loads the dataset (prefers examples/calc_x/data/*.parquet, falls back to data.jsonl or a small demo), spawns a...
How can I evaluate the model after the training is complete? Could you please provide some information on the corresponding metrics and evaluation methods? [36m(TaskRunner pid=32107)[0m ("Initial validation metrics: {'val/reward':...
Can you provide a agent example using naive python code without any frameworks like autogen / langchain? I think it is very important for customized need.