Flexible-Fairness-Constraints icon indicating copy to clipboard operation
Flexible-Fairness-Constraints copied to clipboard

Excause me, I am trying to reproduce the results in this paper but fails to find the reddit dataset in this repo. Could you tell me where can I download it?

Open guanchuwang opened this issue 4 years ago • 3 comments

guanchuwang avatar Jan 11 '21 10:01 guanchuwang

Hi, the reddit dataset is too large to put on here, but it was collected from the Reddit dump from 2017-11 collected by Jason Baumgartner: https://www.reddit.com/r/pushshift/comments/bcxguf/new_to_pushshift_read_this_faq/

joeybose avatar Jan 12 '21 05:01 joeybose

Which file did you download as the dataset from this site? Is it RS_2017-11.xz from https://files.pushshift.io/reddit/submissions/? Tell me please. I cannot appreciate more and I can download it by myself.

guanchuwang avatar Jan 12 '21 05:01 guanchuwang

Hey, I believe thats the correct one RS_2017-11.xz, it's been a few years so I'm not 100% sure as I only kept the processed versions of the dataset to save space. Some of the scripts in the repo should be able to process this for you as well.

joeybose avatar Jan 12 '21 07:01 joeybose