Piotr Żelasko comments

Results 523 comments of


                                            Piotr Żelasko

Full-librispeech training

That's weird. Something went wrong when uploading. I'm pushing the missing files, you can expect them to be there in the next hour.

Full-librispeech training

> @glynpu > > @pzelasko counts from 1, not from 0. So you should use epoch-{7,8,9}.pt We'll probably need to make the indexing consistent, different parts of code base count...

Low hanging fruit: neural language model

This is a pretty feature-rich and efficient implementation of sub-word tokenizers (with training methods too) https://github.com/huggingface/tokenizers

Low hanging fruit: neural language model

Looks cool! My two cents are it’s probably worth it to start with RNNLM and eventually try some autoregressive transformers like GPT2 (small/medium size).

WIP: add compute-post.

> Also, I find the alignment information contained in the supervision is too simple Can you describe the issue more? I'm not sure I understand what's missing there. We could...

BTW I wonder if we should support piping these programs together, Kaldi-style. Click easily allows doing that with [file type arguments](https://click.palletsprojects.com/en/8.0.x/arguments/#file-arguments). We could do that by writing/reading JSONL-serialized manifests in...