unilm
unilm copied to clipboard
[MWPBench] AGIEval-Math is actually a part of MATH/Test. Why Including both of them?
According to http://arxiv.org/abs/2403.02884, AGIEval-Math is actually a part of MATH/Test.
According to http://arxiv.org/abs/2403.02884, AGIEval-Math is actually a part of MATH/Test.
By reviewing the dataset, we have verified that the MATH test indeed includes AGIEval-Math. Thanks for pointing this out, @tongyx361!
We will update both the test set and the corresponding results accordingly.