Nikolay Bogoychev

Results 75 comments of Nikolay Bogoychev

@jerinphilip can models be swapped if they have different vocabulary sizes? If they have completely different configurations? We support users providing their own models and [people try to use this...

I guess you can hack it around by manually adding EoS as a post-processing step but max length 1.2 is too low and will create issues when there's a lot...

I just realised that this code was never tested properly on windows. Windows `std::string` are not unicode by default and I wonder if this code would just spit garbage on...

The reason for QT's behaviour is because various OS-s use different default locales and representation. mbrlen would definitely have different behaviour on linux and windows due to the string defaults.

I think that we shouldn't rely on snakemake caching to get it to work but it should be part of the pipeline with something like "train-reverse-model: true" appended at the...

```bash data/data/bg-en/snakemake-bg-en/original/eval$ ls custom-corpus_ devset.bg.gz devset.en.gz merge.bgen.gz ``` The merge is direction dependent.

I see, so i could in theory do a blanket copy of all the clean biclean etc directories and the only thing that would be rebuilt is the vocabulary (since...

I was just about to open the same bug report. +1

@kpu what's our rate limit? Also I noticed that if some jobs fail to download, the execution of the pipeline stops, but doing `make run-local` again, doesn't retry those failed...

Ideally we want the downloader to run in single thread, and the rest of the jobs to use the multithreaded configuration. At the moment I'm running the pipeline in one...