KuaiRec
KuaiRec copied to clipboard
duplicate records in the big matrix
Hi,
Thanks for the fantastic effort of collecting this dataset. While I find there are duplicate <user_id, item_id, time> records in the big matrix (e.g., user_id: 217, item_id: 3136, time: '2020-09-01 11:27:43.599'). For <user_id, item_id> pairs, there are max 2224 records for one user and one video. In future versions, will you deal with this duplicate record issue?
Thanks!
@Alice1998 Thank you for your valuable feedback! I became aware of the issue upon your mention. After careful thought, I've decided to maintain the current version, as numerous researchers have already downloaded it, and I wish to avoid creating discrepancies. However, your suggestion is excellent! We will address this issue when we introduce additional features to the dataset. Your contributions are greatly appreciated! ^_^