MXNet-for-CDL icon indicating copy to clipboard operation
MXNet-for-CDL copied to clipboard

Can't interpret the text information and ratings matrix imported to NN

Open m2rik opened this issue 8 years ago • 1 comments

https://datascience.stackexchange.com/questions/26653/cant-interpret-the-text-information-and-ratings-matrix-imported-to-nn

m2rik avatar Jan 15 '18 16:01 m2rik

Hi,

For the mult.data file, in 63 1:2 1666:1 132:1 901:1 1537:2 8:1 9:1 912:1

63 is the number of words for this documents, 1:2 means word 1 appears twice in the document, 1666:1 means word 1666 appears once in the document, etc.

For the trainuser.dat, in 10 1631 3591 10272 14851 4662 13172 12684 5324 3595 3404

10 is the number of positive samples for this user, the rest is a list of 10 items that are related to (liked by) this user.

You can check the README file of www.wanghao.in/data/ctrsr_datasets.rar for more details on the datasets.

Best, Hao

js05212 avatar Jan 15 '18 16:01 js05212