massive
massive copied to clipboard
About the official evaluation script
Is there an evaluation script that can directly compare a prediction file against the gold prediction file, i.e., the official evaluation script?
Hi @bozheng-hit, recommend using eval.ai for official test results. We will be opening submissions to the MMNLU-22 phases soon.
Hi @bozheng-hit, recommend using eval.ai for official test results. We will be opening submissions to the MMNLU-22 phases soon.
Is it possible to return results for all languages separately?