cedr icon indicating copy to clipboard operation
cedr copied to clipboard

How to get WebTrack 2012-2014 datasets.

Open haiahaiah opened this issue 4 years ago • 1 comments

Hi, I'm confused about how to get WebTrack 2012-2014 datasets. I would appreciate it if you could provide me with the specific process. Thanks a lot.

haiahaiah avatar Mar 15 '21 13:03 haiahaiah

Information on obtaining the two ClueWeb collections are found here:

  • https://lemurproject.org/clueweb09.php/
  • https://lemurproject.org/clueweb12/

They are purchased from CMU and sent on hard drives. Unfortunately, they cannot be distributed by other means, from my understanding.

The WebTrack queries and qrels are from TREC, and can be found here: https://trec.nist.gov/data/webmain.html

Does this help?

seanmacavaney avatar Mar 15 '21 14:03 seanmacavaney