evaluation icon indicating copy to clipboard operation
evaluation copied to clipboard

Code and Data for Evaluation WG

Results 50 evaluation issues
Sort by recently updated
recently updated
newest added

related: #79 @jaketae: here you go :) (_I'm having more free time than usual until the next Monday, so please feel free to assign me other tasks._)

use to test generalization to unseen labels; maybe use FLEX?

few_shot

- Evaluated on: GPT Neo There is an error would happen if you want to evaluate gpt-neo related models (e.g. "EleutherAI/gpt-neo-125M") using the original version of transformers. This is due...