DeepMatch-Torch issues

Two Bugs regarding YoutubeDNN

2

There are two bugs related to the codes for YoutubeDNN model. 1. The gen_data_set_youteube has a typo... should be youtube. (Not necessarily a bug lol) 2. Here's the *first* bug:...

NeverSalar

question

DSSM有线上测试过效果吗

3

如题，想请教下这些召回模型有实际上线验证码，比如DSSM

skriser

documentation

为什么pytorch_lightning不在setup.py里？

2

在linux安装的时候没法直接跑

ajing

documentation

### Discussed in https://github.com/bbruceyuan/DeepMatch-Torch/discussions/4 Originally posted by **xbingsun** May 25, 2022 您好，我在mind实现代码中看到调用 input_from_feature_columns 函数得到用户的历史行为的表示，而在该函数中使用了get_varlen_pooling_list对用户的行为进行了pooling操作，想要请问这样做的原因是什么呢？好像原版本的deepmatch中也使用了这个操作，根据原论文胶囊层输入应该是各个items的embeddings（如下图）。

xbingsun

bug

在创建用户特征的时候，生成的历史观影数据长度的声明前后不一致？

5

在run_youtubednn.py中，`SEQ_LEN = 50`，但是在创建`user_feature_columns`这一个变量的时候，声明的历史电影长度为10，这会影响模型的正确运行么？ `VarLenSparseFeat(SparseFeat('hist_movie_id', vocabulary_size=feature_max_idx['movie_id'], embedding_dim=embedding_dim, embedding_name="movie_id"), maxlen=10, combiner='mean')` 在创建item tower的时候，X维度为（BatchSize,61） (1+1+1+1+1+50+6=61)，当取item的id的时候，`self.feature_index['movie_id']`为[15:21]，这会导致不能正确地按照维度取值？ ![image](https://github.com/bbruceyuan/DeepMatch-Torch/assets/54670610/02665281-77e0-4ec5-8e12-876746e7f0ac)

areylng

question

DSSM召回率等指标特别低

2

感谢楼主的工作，但是我在用电影数据集测试dssm时发现指标特别低，0.014的样子。我想问一下应该怎样提升指标

rurubaobao

question

DeepMatch-Torch
DeepMatch-Torch copied to clipboard

Metadata

Two Bugs regarding YoutubeDNN

DSSM有线上测试过效果吗

为什么pytorch_lightning不在setup.py里？

关于mind代码中对用户行为的pooling操作

在创建用户特征的时候，生成的历史观影数据长度的声明前后不一致？

DSSM召回率等指标特别低

← Metadata

Owner

Metadata

DeepMatch-Torch DeepMatch-Torch copied to clipboard

Metadata

Two Bugs regarding YoutubeDNN

DSSM有线上测试过效果吗

为什么pytorch_lightning不在setup.py里？

关于mind代码中对用户行为的pooling操作

在创建用户特征的时候，生成的历史观影数据长度的声明前后不一致？

DSSM召回率等指标特别低

← Metadata

Owner

Metadata

DeepMatch-Torch
DeepMatch-Torch copied to clipboard