Transformer-DyNet icon indicating copy to clipboard operation
Transformer-DyNet copied to clipboard

An Implementation of Transformer (Attention Is All You Need) in DyNet

Results 3 Transformer-DyNet issues
Sort by recently updated
recently updated
newest added

This is a great repo. Can this code support multi-GPU training? I wonder if it can achieve the same performance as tensor2tensor on wmt14-en-de corpus. Thanks.

when i run that command ./build_gpu/transformer-decode --model-path model/iwslt-envi --beam 5 -T sample-data/tst2013.en.capped i got that error: .... Reading test examples from sample-data/tst2013.en.capped Segmentation fault (core dumped) ... I find many...

rebase of my work on dynet to get it to compile with c++20