LLM-Agent-Paper-List
LLM-Agent-Paper-List copied to clipboard
Consider adding AppWorld to the list
Thanks for a great survey and for setting up this repository!
I know the papers listed here from Zhiheng Xi et al. But if you are open to adding newer papers to the list, I would like to add AppWorld.
🔗 Website: https://appworld.dev/ 📄 Paper: https://arxiv.org/abs/2407.18901 🐦 Tweet: https://x.com/harsh3vedi/status/1818311843976233198 💬 Blog: https://appworld.dev/blog 🎬 Video(s): https://appworld.dev/video 🌎 Code: https://github.com/stonybrooknlp/appworld 🧭 Data (task, trajectories) explorer, playground: https://appworld.dev/task-explorer 🔍 API explorer: https://appworld.dev/api-explorer 📊 Leaderboard: https://appworld.dev/leaderboard
TLDR: Introduces AppWorld Engine, a high-fidelity execution environment of 9 day-to-day apps, operable via 457 APIs, populated with digital activities of 106 people living in a simulated world, and an associated benchmark of natural, diverse, and challenging autonomous agent tasks requiring rich and interactive coding.
In my opinion, AppWorld fits in the following (sub)sections.
- 4.1 Benchmarks for LLM-based Agents
- 1.3.1 Tool Using
- 3.2.1 Text-based Environment
- 2.1.1 Task-oriented Deployment (Web scenarios)
- 3.2.2 Virtual Sandbox Environment
- 1.1.5 Transferability and Generalization