biomedical
biomedical copied to clipboard
Consider enforcing canonical train/dev/test splits for bigbio schema
Datasets with k-fold definitions (e.g., GAD) are currently cumbersome to use. Maybe consider always enforcing train/dev/test splits, similar to what BLURB did for HoC and BIOSSES. source
schema could preserve folds for compatibilities sake.