KuaiRec icon indicating copy to clipboard operation
KuaiRec copied to clipboard

duplicate records in the big matrix

Open Alice1998 opened this issue 2 years ago • 1 comments

Hi,

Thanks for the fantastic effort of collecting this dataset. While I find there are duplicate <user_id, item_id, time> records in the big matrix (e.g., user_id: 217, item_id: 3136, time: '2020-09-01 11:27:43.599'). For <user_id, item_id> pairs, there are max 2224 records for one user and one video. In future versions, will you deal with this duplicate record issue?

Thanks!

Alice1998 avatar Sep 26 '22 14:09 Alice1998

@Alice1998 Thank you for your valuable feedback! I became aware of the issue upon your mention. After careful thought, I've decided to maintain the current version, as numerous researchers have already downloaded it, and I wish to avoid creating discrepancies. However, your suggestion is excellent! We will address this issue when we introduce additional features to the dataset. Your contributions are greatly appreciated! ^_^

chongminggao avatar Apr 30 '23 03:04 chongminggao