agent-lightning issues

Add batch rollout helpers and multi-task queueing

## Summary - allow `training_rollout_batch`/`validation_rollout_batch` APIs to receive per-task resource lists - add `batch_size` to `AgentRunner` and propagate through `Trainer` for batched polling - test batch rollouts with resource lists...

ultmaster

codex

feat: allow programmatic defaults in lightning_cli

## Summary - allow passing defaults for lightning_cli arguments - support programmatic defaults overriding required CLI args - test default overrides for required parameters ## Testing - `pytest` ------ https://chatgpt.com/codex/tasks/task_e_689590e72300832e95482ca7a6b7e4f8

ultmaster

codex

[About Credit Assignment] How to implement an action-dependent reward function?

1

The credit assignment in the sample code applies a uniform policy to all states. However, I'd like to assign rewards differently based on the specific action taken. What would be...

Kwen-Chen

enhancement

question

verl

credit assignment

Support for TS/JS based agent frameworks?

4

I see the current implementation is written in Python, but I’m wondering about framework support. Are there any plans to support TypeScript/JavaScript-based frameworks as well? Or would it be possible...

jordanparker6

help wanted

LLMProxy doesn't support stream

1

We can't get token ids from LLM Proxy when stream is enabled. vLLM has token_ids returned in streaming. The problem is with LiteLLM and has three parts. 1. `.venv/lib/python3.12/site-packages/litellm/litellm_core_utils/streaming_handler.py` has...

ultmaster

blocked by upstream

proxy

feat: add phoenix tracer

20

Add support for Phoenix tracer

Lincyaw

ci-spider

ci-apo

ci-gpu

Refactor LLMProxy to run Uvicorn in isolated process

1

## Summary This PR refactors the `LLMProxy.start()` logic to launch the Uvicorn proxy server in a fully isolated process using `multiprocessing.spawn`. The previous implementation ran the server in a background...

beanie00

i write a custom agent base on qwen-agent, can i use it in my project?

1

doublnt

Wrong Discord Chat link

1

The linked Discord Chat seems to be wrong. The link leads to a Discord chat with a different topic.

Franky1

I dont understand how it works under the hood.

2

Like does it fine tune the model or does it adjust the system prompt or do anything else . like how does the agent learn becaue agent can get intelligence...

hemangjoshi37a

question

agent-lightning
agent-lightning copied to clipboard

Metadata

Add batch rollout helpers and multi-task queueing

feat: allow programmatic defaults in lightning_cli

[About Credit Assignment] How to implement an action-dependent reward function?

Support for TS/JS based agent frameworks?

LLMProxy doesn't support stream

feat: add phoenix tracer

Refactor LLMProxy to run Uvicorn in isolated process

i write a custom agent base on qwen-agent, can i use it in my project?

Wrong Discord Chat link

I dont understand how it works under the hood.

← Metadata

Owner

Metadata

agent-lightning agent-lightning copied to clipboard

Metadata

← Metadata

Owner

Metadata

agent-lightning
agent-lightning copied to clipboard