ir_datasets
ir_datasets copied to clipboard
Provides a common interface to many IR ranking datasets.
**Is your feature request related to a problem? Please describe.** @andrewyates points out these two use cases when working with docs: 1. do something sane for the situation where you...
Allows defining datasets locally
**Dataset Information:** "The Health Misinformation track aims to (1) provide a venue for research on retrieval methods that promote better decision making with search engines, and (2) develop new online...
**Dataset Information:** " The Fair Ranking track focuses on building two-sided systems that offer fair exposure to ranked content producers while ensuring high results quality for ranking consumers. " **Links...
**Dataset Information:** "The goal of the new Clinical Trials track is to focus research on the clinical trials matching problem: given a free text summary of a patient health record,...
**Dataset Information:** "The main aim of Conversational Assistance Track (CAsT) is to advance research on conversational search systems. The goal of the track is to create reusable benchmarks for open-domain...
**Dataset Information:** "The CrisisFACTS track focuses on temporal summarization for first responders in emergency situations. These summaries differ from traditional summarization in that they order information by time and produce...
**Dataset Information:** "The Deep Learning track focuses on IR tasks where a large training set is available, allowing us to compare a variety of retrieval approaches including deep neural networks...
**Dataset Information:** **Links to Resources:** **Dataset ID(s):** **Supported Entities** - [] docs - [] queries - [] qrels - [] scoreddocs - [] docpairs - [qrels_2016.csv](https://github.com/allenai/ir_datasets/files/6888649/qrels_2016.csv) **Additional comments/concerns/ideas/etc.**
TREC CAST
For conversational AI. http://www.treccast.ai/ **Documents:** Uses MS-MARCO, TREC CAR, and Washington Post collections. Also includes a list of duplicate files, due to the combination of collections, it seems. **Queries/qrels:** Queries...