BrowserGym
BrowserGym copied to clipboard
GAIA
- add gaia and gaia eval (based on assistantbench pr - https://github.com/ServiceNow/BrowserGym/pull/186/)
- refactor writing predictions to jsonl to a utils file
- fix assistantbench readme
What is the status of this?
This PR still requires some work before merging. The implementation should be very similar to #186
I won't have time to work on it, is there someone else who would be willing to take it?
Stale