icefall icon indicating copy to clipboard operation
icefall copied to clipboard

lstm-transducer model for Chinese data

Open lucasjinreal opened this issue 3 years ago • 5 comments

image

Does there a lstm-transducer eg for Any Chinese dataset?

lucasjinreal avatar Sep 22 '22 07:09 lucasjinreal

Yes, please see

https://k2-fsa.github.io/sherpa/python/streaming_asr/lstm/index.html#chinese

csukuangfj avatar Sep 22 '22 07:09 csukuangfj

And here is the demo using the above Chinese model.

https://k2-fsa.github.io/sherpa/python/streaming_asr/endpointing.html#endpointing-demo-chinese

csukuangfj avatar Sep 22 '22 07:09 csukuangfj

Does there a lstm-transducer eg for Any Chinese dataset?

We will upload the code soon.

csukuangfj avatar Sep 22 '22 07:09 csukuangfj

OK. hoping for it. BTW, does wenet speech dataset is currently most accurate model for daily life voice recognition?

lucasjinreal avatar Sep 22 '22 07:09 lucasjinreal

I think Wenetspeech is the largest open-source Chinese dataset so far, to the best of my knowledge.

csukuangfj avatar Sep 22 '22 07:09 csukuangfj