MoChA-pytorch icon indicating copy to clipboard operation
MoChA-pytorch copied to clipboard

PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)

PyTorch Implementation of Monotonic Chunkwise Attention

Requirements

  • PyTorch 0.4

TODOs

  • [x] Soft MoChA
  • [x] Hard MoChA
  • [ ] Linear Time Decoding
  • [ ] Experiment with Real-world dataset

Model figure

Model figure 1

Linear Time Decoding

It's not clear if authors' TF implementation supports decoding in linear time. They calculate energies for whole encoder outputs instead of scanning from previously attended encoder output.

References