molokanov50 issues

Results 9 issues of


molokanov50

Upload a model only once & translate in any lang pair as much as possible

Hello team, Based on a trained multilingual Fairseq model (e.g. M2M-100), I run my translations as a service in a Docker container according to the following scheme: as an input...

question

needs triage

Why are only one or two first sentences translated?

Hello team, I'm just trying to translate long texts consisting of 6 - 8 sentences but not exceeding 100 tokens in general (in order to not overcome model's memory consumption)...

Parallel (batched) translation from different source languages

Hi team, The opportunity of parallel translation (in a single batch) from different source languages is of a particular interest,. The current obstacle lies in the fact that the tokenizer...

question

needs triage

Finetuning pretrained translation model on new vocabulary

Hi. My goal is to finetune a large BERT-based MT model (e.g. `NLLB-200-1.3B`) on new words that are out of model's vocabulary. I managed to finetune it only from a...

question

needs triage

Different LID results for uppercase and lowercase texts

There is an incorrect behavior of LID model presented here (`lid218e.bin`). Particularly: `Как сообщает пресса` is identified as `rus_Cyrl` that is correct, but `КАК СООБЩАЕТ ПРЕССА` - as `eng_Latn`; `Добрый...

bug

needs triage

LID model - list of supported languages and source code

Hello, Where can I find the list of languages LID model can identify? It seems that only FastText's list of 157 languages is published but it is an older version...

question

needs triage

molokanov50

Upload a model only once & translate in any lang pair as much as possible

Why are only one or two first sentences translated?

Parallel (batched) translation from different source languages

Finetuning pretrained translation model on new vocabulary

Different LID results for uppercase and lowercase texts

LID model - list of supported languages and source code

Implementation in a loop clogs up memory

Filtering pipeline produces a config with wrong lang directions

TypeError: load() missing 1 required positional argument: 'Loader'