Voice-Tech-Study
Voice-Tech-Study copied to clipboard
语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总
Video
Book
Spoken Language Processing: A Guide to Theory, Algorithm and System Development by Xuedong Huang (Author), Alex Acero (Author), Hsiao-Wuen Hon (Author) https://www.amazon.com/Spoken-Language-Processing-Algorithm-Development/dp/0130226165
Fundamentals of Speech Recognition by Lawrence Rabiner (Author), Biing-Hwang Juang (Author) https://www.amazon.com/Fundamentals-Speech-Recognition-Lawrence-Rabiner/dp/0130151572
Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) 2015th Edition by Dong Yu (Author), Li Deng (Author) https://www.amazon.com/Automatic-Speech-Recognition-Communication-Technology/dp/1447157788
Speech and Language Processing, 2nd Edition by Daniel Jurafsky (Author), James H. Martin (Author) https://www.amazon.com/Speech-Language-Processing-Daniel-Jurafsky/dp/0131873210
Pattern Recognition and Machine Learning (Information Science and Statistics) by Christopher M. Bishop (Author) https://www.amazon.com/Pattern-Recognition-Learning-Information-Statistics/dp/0387310738
Wangd-kaldi-book http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/Wangd-kaldi-book
解析深度学习:语音识别实践 https://book.douban.com/subject/26820808/
Toolkit
Kaldi https://github.com/kaldi-asr
Eesen https://github.com/srvk/eesen
CNTK https://github.com/Microsoft/CNTK
Paper
L. R. Rabiner, “A tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, 1989
A. Graves, S. Fern´andez, F. Gomez, and J. Schmidhuber, “Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks,” in International Conference on Machine Learning (ICML), ACM, 2006, pp. 369–376.
Reading list from NCMMSC Speech group http://cslt.riit.tsinghua.edu.cn/mediawiki/index.php/Reading_list_from_NCMMSC_Speech_group
Reference
https://www.zhihu.com/question/65516424/answer/232899728 https://www.zhihu.com/question/24342192/answer/225984574 https://www.zhihu.com/question/39701966/answer/88084026 https://www.msra.cn/zh-cn/news/features/book-recommendation-speech https://cloud.tencent.com/developer/article/1031646 https://book.douban.com/review/8658211/ https://blog.csdn.net/chenghaoy/article/details/82761586 http://ftli.farbox.com/post/automatic-speech-recognition-asr-courses http://zhaoshuaijiang.com/2019/02/15/end-to-end-asr/ https://antkillerfarm.github.io/speech/2018/04/16/speech.html