[Feature] Adding Large Reasoning Models Results

Open Aaron617 opened this issue 10 months ago • 0 comments

Hi AgentBench Team,

Thanks for your awesome effort in constructing this benchmark. I would like to ask have you or plan to add the experimental results of large reasoning models like deepseek-r1, o3-mini, etc on AgentBench?

Best, Mengkang

Feb 18 '25 08:02 Aaron617