Piotr Żelasko

Results 523 comments of Piotr Żelasko

Would it make sense to use a pure TDNN/TDNNF/CNN model for alignments? I was investing alignments from the conformer recently and my feeling was that they weren't perfect (even though...

I'll submit a PR with the code that allows computing alignments and visualizing them later. As to data augmentation of alignments, we could extend most transforms to handle it --...

Regarding this: it's actually weird that CTC and MMI alimdl would not make a difference. Some time ago, I think I looked at both CTC and MMI posteriors, and they...

FWIW I believe we were using the "cut concatenation" mechanism that packs multiple cuts+supervisions in a single "sequence" for several months now. IF there was an issue, I think all...

Since we have an ali model then maybe another option is to add frame wise cross entropy loss using that alignment, and apply the label smoothing there?

Could also be surprising if we want to try out a different optimizer that doesn't have Adam-like gradient scaling.

We might need to make the port configurable. For a quick work-around you can change it here: https://github.com/k2-fsa/snowfall/blob/master/snowfall/dist.py#L8

We can choose it randomly - although I think with `torch.distributed.launch` we'd have to choose it *outside* of the python script, and with `torch.distributed.spawn` we can choose it inside the...

Hmm, I've never seen this one before...

Oooh, now it all finally makes sense. Thanks for debugging this guys. I'll add a fix to the cut ids partitioning in the sampler.