uptrain
uptrain copied to clipboard
Add an operator to compute MMLU score
MMLU (Massive Multitask Language Understanding) is a benchmark designed to measure knowledge acquired during pretraining by evaluating models exclusively in zero-shot and few-shot settings.