lm-evaluation-harness
lm-evaluation-harness copied to clipboard
Mmlu Pro
https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro
Added new task: MMLU-Pro
Improvements:
- Higher quality/difficulty compared to original mmlu
- Multiple-choice with 10 options instead of 4
Closes #1947