opencompass
opencompass copied to clipboard
[Feature] is there any way to evaluate an agent?
描述该功能
is there any way to evaluate an agent? btw how to use transbench?
是否希望自己实现该功能?
- [ ] 我希望自己来实现这一功能,并向 OpenCompass 贡献代码!