ir_datasets
ir_datasets copied to clipboard
high recall retrieval datasets
For testing e.g., total recall systems.
Datasets include RCV1 and JebBush.
The document collections are not publicly available, but we can provide instructions on how to get copies, just like for trec-robust04
and the TREC multilingual datasets.
@eugene-yang is an expert on these. Maybe he can help.
Sure thing!