Transformer-DyNet
Transformer-DyNet copied to clipboard
Does support multi-gpus training ?
This is a great repo. Can this code support multi-GPU training? I wonder if it can achieve the same performance as tensor2tensor on wmt14-en-de corpus. Thanks.