agential
agential copied to clipboard
[Feature Request]: ReAct
Feature Description
Implement:
- [x] HotpotQA
- [x] #89
- [x] FEVER
- [x] #90
- [x] #91
- [x] #92
- [x] #93
- [x] #94
- [x] #95
- [ ] #96
- [ ] #97
- [ ] #98 (includes ALFWorld & WebShop)
The decision-making benchmarks (ALFWorld, WebShop, and AgentBench) will require more design work. Swapping out the prompts won't suffice.
Run:
- [ ] HotpotQA
- [ ] TriviaQA
- [ ] AmbigNQ
- [ ] GSM8k
- [ ] SVAMP
- [ ] TabMWP
- [ ] MBPP
- [ ] HumanEval
- [ ] ALFWorld
- [ ] WebShop
- [ ] AgentBench (includes ALFWorld & WebShop)