generative-recommenders
generative-recommenders copied to clipboard
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, ICML...
when i see: B = all_timestamps.size(0) N = self._max_seq_len t = F.pad(self._pos_w[:2 * N - 1], [0, N]).repeat(N) t = t[..., :-N].reshape(1, N, 3 * N - 2) r =...
Reviewed By: hanli0612 Differential Revision: D57754655
Differential Revision: D57292670
Reviewed By: hanli0612 Differential Revision: D57320475
https://github.com/facebookresearch/generative-recommenders/blob/main/data/dataset.py#L142 你好,我看代码这里的每个item代表一个token,但是我看论文里说是item0,action0,item1,action1这种形式来作为输入的格式的,不知道我是哪里理解有问题?
NOTE: This is an experiment, and a draft. Do not review. The following change requires a private patchset that is not yet available outside of https://github.com/plotfi/triton/pull/4 This patch adds usage...
Based on #57, this version uses the autotuned to toggle use of TMA.
The TMA version is added as a separate file, we have seen it improves perf in some cases and regresses perf in other cases. We are also working on improving...
What computing power configuration is required for trainging the GRs? With the experiment setup in the article, i.e. 256 H100, how long should the model be trained?