M3Exam
M3Exam copied to clipboard
Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"
这个是阿里自己做的数据集,还是公共认可额数据集?You can download the data from [here](https://cutt.ly/m3exam-data).
Hi, Thanks for providing this work, I found I can't open this data download [link](https://cutt.ly/m3exam-data), could you tell me wgat should I do?
Dear authors, Thanks for your open sourcing! It seems that the correct image placeholder would be `(image)[image-x.png]` or `(image)[image-x.jpg]`. However, I find a lot of inconsistent image placeholders in the...
求详细的实验结果
请问Multimodal Evaluation实验的结果,中文数据集上各个模型的表现,能提供下具体的指标吗。
Would it be possible to release the results of your evaluations in some machine readable format (json, csv, txt etc.)? For example your results from the tables in your paper...