firefox-translations-training
firefox-translations-training copied to clipboard
Training pipelines for Firefox Translations neural machine translation models
Let's make sure the infrastructure settings are adjusted based on the current needs of the pipeline. There shouldn't be any waste but at the same time infrastructure shouldn't limit experimentation...
Language pair: en-ru After training on a mix with back-translated data, it doesn't continue training on the original corpus. This is the same problem I investigated a couple of years...
We turned off spot instances for training tasks in #356. This issue tracks all the things we need to do before we can turn them back on. ```[tasklist] ### Blocks...
This is the lists of tasks that we need to handle in order to ramp up our ability to train many languages. These are things that break training runs, make...
This is a meta issue for tracking issues to improve the robustness of the translation models: ```[tasklist] ### Tasks - [ ] https://github.com/mozilla/firefox-translations-training/issues/514 - [ ] #215 - [ ]...
This is a meta issue for brainstorming ideas and tracking issues to improve the translation quality in general: ```[tasklist] ### General improvements - [ ] https://github.com/mozilla/firefox-translations-training/issues/843 - [ ] https://github.com/mozilla/firefox-translations-training/issues/178...
We identified a potential issue related to the expiration dates of task artifacts. Currently, all the logs and artifacts of a task are set to be deleted after 1 year....
#277 is adding support for easily configuring workers at decision task time. This makes it easy to use different sets of workers on different branches. However, we also want to...
It would be nice to optimize the end to end time. Here it is 1 hour and 25 minutes https://share.firefox.dev/3I192Z3 The training steps are the ones that take the longest...
When training languages it can be hard to know if alignments are generated correctly and behaving as expected. They can be visualized with a tool like [word-alignment-visualization](https://pypi.org/project/word-alignment-visualization/). During the alignment...