agent-lightning icon indicating copy to clipboard operation
agent-lightning copied to clipboard

Agent Lightning Backlog Tracker

Open lunaqiu opened this issue 4 months ago β€’ 2 comments

This issue serves as a central backlog & roadmap tracker for Agent Lightning.

Our core members come from a research and engineering team at MSRA. As our team is still forming, we're currently severely understaffed and welcome new contributors. We encourage all Agent Lightning users to share your thoughts on backlog itemsβ€”please comment on which issues interest you most, including your priorities, preferences, and any suggestions you might have.

Emoji Status Description
πŸ’‘ Idea/Discovery Needs more investigation or discussion.
πŸ“‹ Ready to Start Scoped, prioritized, and ready for development .
πŸƒβ€β™‚οΈ In progress Someone is actively working on this.
πŸ‘€ In review/Testing/QA Awaiting code review or in the qa and testing phase.
βœ‹ Blocked Halted, waiting for an answer or a dependency.
βœ… Done The task is complete.
❌ Won't Do This task has been cancelled.

Core Stability

  • βœ…P0 - Bugfix for #37 #63 @hzy46
  • πŸ’‘P0 - Bugfix for tracer can't get data from requests.post
  • πŸ‘€P0 - Record rollout level reward before dropping trajectories @hzy46

Documentation and Examples

  • πŸ‘€P0 - A framework-less example with Search-R1 #64 @SiyunZhao @hzy46 #147
  • πŸƒβ€β™‚οΈP0 - Debug tutorial on the way
  • πŸƒβ€β™‚οΈP1 - A tool selection example #65 @XufangLuo

New Features

-πŸ’‘P2 - Customizable triplet

Algorithms

  • πŸƒβ€β™‚οΈP0 - Credit Assignment #31

Observability

  • πŸƒβ€β™‚οΈP0 - Sending traces to AgentOps #43 @mydmdm

lunaqiu avatar Aug 25 '25 10:08 lunaqiu

Backlog v0.3 - what's on my mind:

  • Tinker support
  • Azure OpenAI SFT support (cloud-sft branch)
  • SqliteLightningStore
  • Hao's improvement on tracer
  • Online RL example
  • VERL 0.6 and vllm 0.11 support
  • Customizing AgentModeDaemon (probably needs refactor there)
  • Switch to uv for dependency management
  • Multi-modality example
  • Merge Unsloth SFT trainer into algorithm zoo, and compare APO, VERL and SFT on calc-x
  • Unify helper for: async in sync in async. unicorn server start.
  • Support multi-prompts auto optimization.
  • Collect human feedbacks within algorithms.

ultmaster avatar Oct 17 '25 15:10 ultmaster

Support for VERL 0.6 would be great indeed!

xavier-owkin avatar Oct 23 '25 09:10 xavier-owkin