NYU-DLSP20 Self-Attention Paragraph Typos

Self-Attention Paragraph Typos

Open PeppeSaccardi opened this issue 3 years ago • 2 comments

In the paragraph Self-Attention(I) of Week 12/Attention and the Transformer there is a little mistake after the definition of the hidden layer $h=Xa$ as matrix multiplication: the vector $a$ should belong to $\mathbb{R}^t$ instead of $\mathbb{R}^n$

May 05 '21 10:05 PeppeSaccardi

Of course, thanks! Would you like to send a PR to fix this across all languages?

May 05 '21 23:05 Atcold

Of course! Thank you Alfredo!

May 06 '21 07:05 PeppeSaccardi

NYU-DLSP20 NYU-DLSP20 copied to clipboard

Self-Attention Paragraph Typos

NYU-DLSP20
NYU-DLSP20 copied to clipboard