sparks-heng

Results 1 comments of sparks-heng

yes, I have the same question. As the paper (Attention is all you need) says: ``` pe[pos, 2i] = math.sin(pos / (10000 ** ((2 * i) / d_model))) pe[pos, 2i...