df-dn-paper
df-dn-paper copied to clipboard
Consider saving raw predictions in benchmarks
Saving raw predictions of classification tasks helps improving evaluation metrics (changing/adding/deleting/...). As the test sets are randomly generated, test labels need to be saved at the same time.