agential
agential copied to clipboard
[Feature Request]: ALFWorld for CRITIC
Feature Description
Familiarize yourself with the repository and take a look at the CRITIC repo, paper, and ALFWorld.
Currently, the CRITIC implementation only has prompts for HotpotQA and TriviaQA.
Add ALFWorld prompts and relevant logic to the current CRITIC implementation. You'll see that an agent's current structure is divided into cog/agent
, cog/modules
, and cog/functional/
. This task will require you to modify cog/prompts
but also test your code in all the other relevant modules cog/functional
and cog/agent
. CRITIC does not have any cog/modules
.
What to submit:
- [ ] Set up your environment via the CONTRIBUTING.md
- [ ] Make a Pull Request (PR)
- [ ] Add the prompts for ALFWorld
- [ ] Write a short notebook
tmp.ipynb
incog/agent
showcasing the agent ran on a sample question from the benchmark- [ ] Add print statements for all calls to the LLM for easier debugging + I can easily verify the outputs
- [ ] Include a thorough description of your changes within the PR
- [ ] Request a review from @alckasoc
CRITIC was not tested on ALFWorld. To test CRITIC on ALFWorld, reference how other method papers test on ALFWorld. Check the project lifecycle document. If there is any additional logic for testing CRITIC on ALFWorld, include these specifications in the PR description.
Feel free to ask me questions on Slack if you're confused! Good luck!