Datasets icon indicating copy to clipboard operation
Datasets copied to clipboard

LSHTC: Category IDs correspond to Wikipedia IDs?

Open shatu opened this issue 7 years ago • 1 comments

Hi, I'm asking this question here since I don't know who else to ask this question about the dataset.

Do the category IDs in the LSHTC dataset (in hierarchy.txt) correspond to the Wikipedia IDs, or are they some kind of internal IDs to the system?

And, if they are internal IDs, is it somehow possible to retrieve the corresponding Wikipedia IDs?

shatu avatar Apr 02 '17 19:04 shatu

It's whatever comes with the downloaded data from Kaggle. I think it's the Wikipedia IDs, but you'll have to verify yourself.

timmolter avatar Apr 03 '17 07:04 timmolter