evaluation
evaluation
copied to clipboard
Published
20 hours ago
•
bigscience-workshop
Reame
Issues
Add QASPER to Full Benchmark
Open
epavlick
opened this issue 2 years ago
• 2 comments
use to test generalization to unseen domain; maybe use FLEX?
Aug 10 '21 14:08
epavlick