evaluation Add MNLI to Full Benchmark

Add MNLI to Full Benchmark

Open epavlick opened this issue 4 years ago • 3 comments

coordinate with whoever is working on SuperGLUE, we only need to include MNLI once. But NLI will be held-out from model training (whereas the other SuperGLUE tasks will not) so interpreting MNLI results is different from other superglue tasks.

use to test generalization to unseen task; maybe use FLEX?

Aug 10 '21 14:08 epavlick

I can do it :)

Aug 10 '21 15:08 PierreColombo

@PierreColombo I'd love to help contribute to this one if you need any help!

Aug 10 '21 17:08 wilsonyhlee

MNLI in PS here

Apr 25 '22 17:04 manandey

evaluation evaluation copied to clipboard

Add MNLI to Full Benchmark

evaluation
evaluation copied to clipboard