Synthesizer
Synthesizer copied to clipboard
Why does encoder still use vanilla dot-product self-attention?
Here the synthesizer decoder uses dot-product self-attention for encoder-decoder attention. Is that correct?