Piotr Żelasko comments

Results 523 comments of


                                            Piotr Żelasko

LSTM model

I'll look into it. I'm also looking at other aspects of the recipe - e.g. we're currently using position dependent phones, so we're getting 4x the number of output symbols...

LSTM model

I changed the phones to position independent and that's how an example of posteriors looks like in an unmodified model (the first is the "as-is" output, the other one is...

LSTM model

(this is in the middle of training, i.e. checkpoint from epoch 5)

LSTM model

Update: never mind, it was not the latest k2, and the WER for that model is still 99%. Almost all the hypotheses are empty texts. Will keep looking.

LSTM model

FYI I ran the full 960h librispeech training (with speed perturbation) with the CTC graph, the WER is: ``` 2021-01-03 08:53:22,337 INFO [decode.py:217] %WER 10.05% [5285 / 52576, 725 ins,...

Random sampling with concatenation vs bucketing

Cool! In that case I'll re-attempt this.

Random sampling with concatenation vs bucketing

@zhu-han I tried again with your fix, but I'm still getting the following error: ``` File "./mmi_att_transformer_train.py", line 104, in get_objf nnet_output, encoder_memory, memory_mask = model(feature, supervision_segments) File "/home/hltcoe/pzelasko/miniconda3/envs/k2env/lib/python3.7/site-packages/torch/nn/modules/module.py", line...