Evgeny Pavlov

Results 112 comments of Evgeny Pavlov

please sync with main, it includes a simple CI now

As a temporary workaround you can either increase retries (`--restart-times 3`) or increase the number of threads for this job or both. Parallelization is controlled with `--cores all` option and...

@XapaJIaMnu could you please clarify how this will help?

This might be a bug of Marian. Memory shouldn't grow after `--shuffle-in-ram` is removed and we should use `--shuffle data` mode. It was discussed in https://github.com/mozilla/firefox-translations-training/pull/70#discussion_r800975032

Training teachers with `--shuffle batches` leads to such training curves. Maybe other factors are at play here.

Related Marian issue: https://github.com/marian-nmt/marian-dev/issues/148

Increasing early stopping thresholds can help, but it still does not properly fine tune on some languages, I assume because of low quality of the data. ``` training-teacher-base: # remove...

This issue happens because of a conflict with preinstalled conda. We should modify the Makefile so that it always uses our mamba installation for snakemake.

> One open question here is what Snakemake does with non-0 exit codes of scripts. I am completely new to Snakemake ... Snakemake deletes the job output that is specified...

Also use `--disable_hardrules` as it was suggested [here](https://github.com/mozilla/firefox-translations-training/issues/71#issuecomment-1040029744).