more question about dataset
When n = 10,I find more than 10% were removed out of a total of 817,741 adoptions and more than 10K item is remained, which is inconsistent with the data in the paper.Following is how to process the dataset .
sql = 'SELECT * FROM shopping_processing where product_ID in'
'(SELECT product_ID FROM shopping_processing group by product_ID having count(userID)>5)'
pandasql.sqldf(sql , globals())
Can you tell me where is wrong?Thanks.
When n = 10,I find more than 10% were removed out of a total of 817,741 adoptions and more than 10K item is remained, which is inconsistent with the data in the paper.Following is how to process the dataset . sql = 'SELECT * FROM shopping_processing where product_ID in' '(SELECT product_ID FROM shopping_processing group by product_ID having count(userID)>5)' pandasql.sqldf(sql , globals()) Can you tell me where is wrong?Thanks.
Has your problem been solved?