KoLA Ambiguity on the evaluation metrics

Ambiguity on the evaluation metrics

Open zhimin-z opened this issue 1 year ago • 6 comments

Are you evaluating F1 or EM (ROUGE or BLEU) after all for these datasets? I have no idea reading this paper. Also, BLEU has a lot of variants, which variant do you use for implementation?

Nov 30 '23 23:11 zhimin-z

KoLA KoLA copied to clipboard

Ambiguity on the evaluation metrics

KoLA
KoLA copied to clipboard