Binbin Zhang comments

Results 133 comments of


                                            Binbin Zhang

Shorter decoding segments when using server/x86

The mismatch of training and inference might be the problem. How about the rescoring accuracy?

Auto-generated encoder model config

@Slyne , please review.

Can I get timestamp info by GPU inference?

@yuekaizhang is it possible?

Can I get timestamp info by GPU inference?

Binding is just for CPU and it is too heavy for the task. I prefer to use torchaudio built-in decoder.

uing shard data type for librispeech, I got errors " WARNING error to parse"

what is your pytorch/torchaudio version?

Exec export_onnx_cpu.py error

@xingchensong , please follow the issue.

dataloder error

Do you have the right python virtual environment, `espnet` is used from your command line.

Adding language model error rate increases rapidly

It's weird. please check: 1. words.txt: for decoding with LM, you should use the words.txt which is generated by LM tools. Do not use the words.txt which is used for...

Adding language model error rate increases rapidly

We only test it on 200+ hours dataset. However, I think it must be something wrong in your pipeline, the WER is greater than 100%.

runtime memory leak

Is there any solution to fix it?