promptulate
promptulate copied to clipboard

Published 20 hours ago •

Reame
Issues

benchmark for Agent

Open Undertone0809 opened this issue 4 months ago • 0 comments

🚀 Feature Request

We need benchmark to eval the ability of Agent.

References

~~- https://github.com/THUDM/AgentBench~~ AgentBench is evlaute different LLM models.

https://toolemu.com/
https://mp.weixin.qq.com/s/0FZrgFosHzzYFBRiV3ba2g

Feb 28 '24 15:02 Undertone0809