firefox-translations-training icon indicating copy to clipboard operation
firefox-translations-training copied to clipboard

Training pipelines for Firefox Translations neural machine translation models

Results 311 firefox-translations-training issues
Sort by recently updated
recently updated
newest added

Let's make sure the infrastructure settings are adjusted based on the current needs of the pipeline. There shouldn't be any waste but at the same time infrastructure shouldn't limit experimentation...

epic
cost & perf

Language pair: en-ru After training on a mix with back-translated data, it doesn't continue training on the original corpus. This is the same problem I investigated a couple of years...

quality

We turned off spot instances for training tasks in #356. This issue tracks all the things we need to do before we can turn them back on. ```[tasklist] ### Blocks...

epic
cost & perf

This is the lists of tasks that we need to handle in order to ramp up our ability to train many languages. These are things that break training runs, make...

meta

This is a meta issue for tracking issues to improve the robustness of the translation models: ```[tasklist] ### Tasks - [ ] https://github.com/mozilla/firefox-translations-training/issues/514 - [ ] #215 - [ ]...

meta
quality

This is a meta issue for brainstorming ideas and tracking issues to improve the translation quality in general: ```[tasklist] ### General improvements - [ ] https://github.com/mozilla/firefox-translations-training/issues/843 - [ ] https://github.com/mozilla/firefox-translations-training/issues/178...

meta
quality

We identified a potential issue related to the expiration dates of task artifacts. Currently, all the logs and artifacts of a task are set to be deleted after 1 year....

taskcluster
tc-p1

#277 is adding support for easily configuring workers at decision task time. This makes it easy to use different sets of workers on different branches. However, we also want to...

enhancement
taskcluster
on-prem

It would be nice to optimize the end to end time. Here it is 1 hour and 25 minutes https://share.firefox.dev/3I192Z3 The training steps are the ones that take the longest...

cost & perf

When training languages it can be hard to know if alignments are generated correctly and behaving as expected. They can be visualized with a tool like [word-alignment-visualization](https://pypi.org/project/word-alignment-visualization/). During the alignment...

enhancement