flowseq
flowseq copied to clipboard
Training time and distillation
Hi, Thanks for sharing your code. How many steps or training time do it need to train the flowseq model on WMT14 EN-DE? Will you release the distillation dataset? It will be helpful for us to reproduce your results.