biomedical icon indicating copy to clipboard operation
biomedical copied to clipboard

Consider enforcing canonical train/dev/test splits for bigbio schema

Open jason-fries opened this issue 2 years ago • 0 comments

Datasets with k-fold definitions (e.g., GAD) are currently cumbersome to use. Maybe consider always enforcing train/dev/test splits, similar to what BLURB did for HoC and BIOSSES. source schema could preserve folds for compatibilities sake.

jason-fries avatar Jun 03 '22 23:06 jason-fries