more question about dataset

Open SSOTTY opened this issue 5 years ago • 1 comments

When n = 10，I find more than 10% were removed out of a total of 817,741 adoptions and more than 10K item is remained， which is inconsistent with the data in the paper.Following is how to process the dataset . sql = 'SELECT * FROM shopping_processing where product_ID in'
'(SELECT product_ID FROM shopping_processing group by product_ID having count(userID)>5)' pandasql.sqldf(sql , globals()) Can you tell me where is wrong？Thanks.

Jun 22 '20 04:06 SSOTTY

When n = 10，I find more than 10% were removed out of a total of 817,741 adoptions and more than 10K item is remained， which is inconsistent with the data in the paper.Following is how to process the dataset . sql = 'SELECT * FROM shopping_processing where product_ID in' '(SELECT product_ID FROM shopping_processing group by product_ID having count(userID)>5)' pandasql.sqldf(sql , globals()) Can you tell me where is wrong？Thanks.

Has your problem been solved？

Nov 15 '21 09:11 zf-shao