test
test copied to clipboard
Measuring Massive Multitask Language Understanding | ICLR 2021
Results
14
test issues
Sort by
recently updated
recently updated
newest added
![image](https://github.com/hendrycks/test/assets/8592144/d9fc1b18-c126-4f1b-8bc7-6d6b9567434b)
Please allow me to consult with you regarding the use of the term "catheter" in clinical_knowledge.csv. ``` Question 80: Which of the following would not be done before catheterizing? A:...
Hello, my name is hiroya iizuka, 12 years experience cardiology doctor. I found typo in clinical_knowledge_test.csv (line 66) In hypovolaemic shock -> In hypovolemic shock Please fix this typo.
This is the top link for the MMLU Benchmark