text_classification icon indicating copy to clipboard operation
text_classification copied to clipboard

intermediate data files

Open rbaral opened this issue 5 years ago • 1 comments

First of all thanks for your effort to make this repo interesting. I ran the preprocessing notebook and was able to get some of the files, however the other scripts use lot of data files which is not easily accessible. I tried lot of time getting the Baidu storage account but couldn't because of oversees phone number. I was just wondering if you can share the script that generates those data files you used in your scripts.

rbaral avatar Jul 12 '19 13:07 rbaral

You can download the dataset on the website of the contest :

https://biendata.com/competition/zhihu/

Here is the dropbox link, I don't know how long it will work :

https://www.dropbox.com/s/3sk2yojptodkmb2/ieee_zhihu_cup.rar?dl=0

Sylv-Lej avatar Jul 24 '19 11:07 Sylv-Lej