OPUS-MT-train icon indicating copy to clipboard operation
OPUS-MT-train copied to clipboard

Conversion of models based on BPE tokenizers to pytorch

Open SaricVr opened this issue 4 years ago • 2 comments

Hello,

Trying to convert the portuguese-english model to pytorch I noticed that this is not possible since the tokenizer is a BPE one. Is there a way of converting it? Or do you plan to release the spm version of such model at some point?

Thank you

SaricVr avatar Jan 29 '21 11:01 SaricVr

New models are on the way. I focus on models trained on Tatoeba-MT challenge data at the moment They will be listed here: https://github.com/Helsinki-NLP/Tatoeba-Challenge/blob/master/results/tatoeba-models-all.md

jorgtied avatar Feb 10 '21 14:02 jorgtied