alimeituan

Results 169 comments of alimeituan

大佬,我看您的数据中似乎没有用到underexpose_user_feat.csv?这个数据是官方没有提供吗? `It includes another file named underexpose_user_feat.csv, the columns of which are: user_id, user_age_level, user_gender, user_city_level`

大佬,请教下在设置offline后get online_topk ``` online_total_click = pd.DataFrame() for c in range(now_phase + 1): print('phase:', c) click_train = pd.read_csv('{}/{}-{}.csv'.format(online_train_path, train_file_prefix, c), header=None, names=['user_id', 'item_id', 'time']) phase_test_path = "{}/{}-{}".format(test_path, test_file_prefix, c) click_test =...

hi,dear confused about the norm, ``` def process_item_feat(item_feat_df): processed_item_feat_df = item_feat_df.copy() # norm txt_item_feat_np = processed_item_feat_df[txt_dense_feat].values img_item_feat_np = processed_item_feat_df[img_dense_feat].values txt_item_feat_np = txt_item_feat_np / np.linalg.norm(txt_item_feat_np, axis=1, keepdims=True) img_item_feat_np = img_item_feat_np /...

这个看出来了,我的意思是您的做法是对行进行归一化axis=1, 每列是个特征,为啥不是对列进行归一化呢?axis=0?? 多谢

请教大佬这里的啥意思啊? ``` def cal_occ(sentence): for i, word in enumerate(sentence): hist_len = len(sentence) co_occur_dict.setdefault(word, {}) for j in range(max(i - window, 0), min(i + window, hist_len)): if j == i or...

哈喽,大佬这个函数是填充那些没有txt,img特征的item的吗? `def fill_item_feat(processed_item_feat_df, item_content_vec_dict):` 如果item都有这些特征是不是就不需要填充了?

哈喽,大佬,我可以将phase7,8,9的数据搁在一起进行预测吗? 也就是不区分phase了,由训练集直接得到给所有user推items,这样做可以吗?

hi,大佬 faiss都没有引入,为啥不报错呢?好诡异啊 请教下大佬是怎么做到的? 在notebook中的文件Rush_0615.ipynb

hi,have you tried tf-1.12+? is it possible ?

emm, 我试了,很差劲,回归根本都不行。这属于量化人的主观问题,需要细化的数据很多,而且要客观。