MoChA-pytorch
MoChA-pytorch copied to clipboard

j-min

→

Metadata

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

Readme
Issues

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

PyTorch 0.4

TODOs

[x] Soft MoChA
[x] Hard MoChA
[ ] Linear Time Decoding
[ ] Experiment with Real-world dataset

Model figure

Model figure 1

Linear Time Decoding

It's not clear if authors' TF implementation supports decoding in linear time. They calculate energies for whole encoder outputs instead of scanning from previously attended encoder output.

References

Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss and Douglas Eck. Online and Linear-Time Attention by Enforcing Monotonic Alignments (ICML 2017)
Chung-Cheng Chiu and Colin Raffel. Monotonic Chunkwise Attention (ICLR 2018)

About

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

pytorch

attention-mechanism

seq2seq

hard-attention

monotonic-attention

75

Stars

19

Forks

Watchers

Owner

j-min

← Metadata

75

Stars

19

Forks

Watchers

Owner

j-min

Metadata

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

Back

MoChA-pytorch MoChA-pytorch copied to clipboard

Metadata

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

TODOs

Model figure

Linear Time Decoding

References

← Metadata

Owner

Metadata

MoChA-pytorch
MoChA-pytorch copied to clipboard