AgentBench
AgentBench copied to clipboard
[Feature] Adding Large Reasoning Models Results
Hi AgentBench Team,
Thanks for your awesome effort in constructing this benchmark. I would like to ask have you or plan to add the experimental results of large reasoning models like deepseek-r1, o3-mini, etc on AgentBench?
Best, Mengkang