rageval
rageval copied to clipboard
List all potential test benchmarks
List all most used datasets in RAG researches, and we will add them to the benchmarks.
- [ ] THUDM/webglm-qa from huggingface: https://huggingface.co/datasets/THUDM/webglm-qa
- [ ] NaturalQuestions from huggingface: https://huggingface.co/datasets/natural_questions
- [ ] #64
- [ ] Trivia QA from huggingface: https://huggingface.co/datasets/trivia_qa
- [ ] Hotpot QA from huggingface: https://huggingface.co/datasets/hotpot_qa
- [ ] WikiEval from huggingface: https://huggingface.co/datasets/explodinggradients/WikiEval