open-unmix-pytorch Use of Transformers/Attention

Use of Transformers/Attention

Open ayush055 opened this issue 3 years ago • 0 comments

Hi, I was just wondering if there have been any attempts at using a Transformer instead of the Bi-LSTM or utilizing attention in the network to potentially improve results?

Jul 19 '22 22:07 ayush055

open-unmix-pytorch open-unmix-pytorch copied to clipboard

Use of Transformers/Attention

open-unmix-pytorch
open-unmix-pytorch copied to clipboard