ir_datasets icon indicating copy to clipboard operation
ir_datasets copied to clipboard

TREC CAsT 2022

Open seanmacavaney opened this issue 3 years ago • 0 comments

Dataset Information:

"The main aim of Conversational Assistance Track (CAsT) is to advance research on conversational search systems. The goal of the track is to create reusable benchmarks for open-domain information centric conversational dialogues."

Links to Resources:

<links including data websites, repositories, papers, etc. that would help in adding the dataset.>

Dataset ID(s) & supported entities:

  • TBD, but likely under trec-cast/

Checklist

Mark each task once completed. All should be checked prior to merging a new dataset.

  • [ ] Dataset definition (in ir_datasets/datasets/[topid].py)
  • [ ] Tests (in tests/integration/[topid].py)
  • [ ] Metadata generated (using ir_datasets generate_metadata command, should appear in ir_datasets/etc/metadata.json)
  • [ ] Documentation (in ir_datasets/etc/[topid].yaml)
    • [ ] Documentation generated in https://github.com/seanmacavaney/ir-datasets.com/
  • [ ] Downloadable content (in ir_datasets/etc/downloads.json)
    • [ ] Download verification action (in .github/workflows/verify_downloads.yml). Only one needed per topid.
    • [ ] Any small public files from NIST (or other potentially troublesome files) mirrored in https://github.com/seanmacavaney/irds-mirror/. Mirrored status properly reflected in downloads.json.

Additional comments/concerns/ideas/etc.

Little public information available yet.

RE: #6

seanmacavaney avatar Feb 25 '22 22:02 seanmacavaney