Video

Book

Spoken Language Processing: A Guide to Theory, Algorithm and System Development by Xuedong Huang (Author), Alex Acero (Author), Hsiao-Wuen Hon (Author) https://www.amazon.com/Spoken-Language-Processing-Algorithm-Development/dp/0130226165

Fundamentals of Speech Recognition by Lawrence Rabiner (Author), Biing-Hwang Juang (Author) https://www.amazon.com/Fundamentals-Speech-Recognition-Lawrence-Rabiner/dp/0130151572

Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) 2015th Edition by Dong Yu (Author), Li Deng (Author) https://www.amazon.com/Automatic-Speech-Recognition-Communication-Technology/dp/1447157788

Speech and Language Processing, 2nd Edition by Daniel Jurafsky (Author), James H. Martin (Author) https://www.amazon.com/Speech-Language-Processing-Daniel-Jurafsky/dp/0131873210

Pattern Recognition and Machine Learning (Information Science and Statistics) by Christopher M. Bishop (Author) https://www.amazon.com/Pattern-Recognition-Learning-Information-Statistics/dp/0387310738

Wangd-kaldi-book http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/Wangd-kaldi-book

解析深度学习：语音识别实践 https://book.douban.com/subject/26820808/

Toolkit

Kaldi https://github.com/kaldi-asr

Eesen https://github.com/srvk/eesen

CNTK https://github.com/Microsoft/CNTK

Paper

L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, 1989

A. Graves, S. Fern´andez, F. Gomez, and J. Schmidhuber, “Connectionist temporal classiﬁcation: Labelling unsegmented sequence data with recurrent neural networks,” in International Conference on Machine Learning (ICML), ACM, 2006, pp. 369–376.

Reading list from NCMMSC Speech group http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/Reading_list_from_NCMMSC_Speech_group

Reference

https://www.zhihu.com/question/65516424/answer/232899728 https://www.zhihu.com/question/24342192/answer/225984574 https://www.zhihu.com/question/39701966/answer/88084026 https://www.msra.cn/zh-cn/news/features/book-recommendation-speech https://cloud.tencent.com/developer/article/1031646 https://book.douban.com/review/8658211/ https://blog.csdn.net/chenghaoy/article/details/82761586 http://ftli.farbox.com/post/automatic-speech-recognition-asr-courses http://zhaoshuaijiang.com/2019/02/15/end-to-end-asr/ https://antkillerfarm.github.io/speech/2018/04/16/speech.html

Voice-Tech-Study
Voice-Tech-Study copied to clipboard

Metadata

Video

Book

Toolkit

Paper

Reference

← Metadata

Owner

Metadata

Voice-Tech-Study Voice-Tech-Study copied to clipboard

Metadata

Video

Book

Toolkit

Paper

Reference

← Metadata

Owner

Metadata

Voice-Tech-Study
Voice-Tech-Study copied to clipboard