Revo comments

Repositories
Issues
Comments

Results 5 comments of


                                            Revo

MPI-training seems not working in the current version.

will multi-node MPI trianing be faster than 8 gpus + NCCL?

XML Markup Scheme for Marian Decoder

@emjotde 1. I had tried the case "only overwrite the target token". I chosed using attention matrix to do replacement. ( En->Fr, En : I played `FOOTBALL,` Fr : j'ai...

Pre-layer_normalize with deep depth model is not working in current version

@emjotde Yes. It is happening for all data-sets`(WMT17 zh-en, WMT14 de-en, CCMT2020 zh-en)`, here i show you my older version run.me script and its training log. P.S. : Actually i...

Fixes #367 adding silent timeout functionality

> We are interested by this functionality. @iandewancker what is the status on your side, do you have some spare time to work on this ? > > @AshBT note...

275 seconds per step

I use 1080 Ti for training 7 sec to 12 sec samples only takes me 1.5 sec/step. Maybe you didn't use your GPU?