respeaker_python_library icon indicating copy to clipboard operation
respeaker_python_library copied to clipboard

Wakeup words support for Chinese

Open juncaofish opened this issue 8 years ago • 13 comments

Hello, Just need a local support feature for voice wakeup~ I notice this file respeaker/pocketsphinx-data/dictionary.txt . So my question is if I want to set a customised wakeup word in Chinese, like "魔镜", can I add to the dictionary something like 'mojing' M UE J IN ? Thanks.

juncaofish avatar Nov 28 '16 03:11 juncaofish

To recognize Chinese words, you need another acoustic model, for example https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/Mandarin/.

But the mandarin acoustic model is not good enough. You may need some training for the model.

xiongyihui avatar Nov 28 '16 04:11 xiongyihui

Thanks for replying. So is MT7688 capable enough to run the mandarin acoustic model?

juncaofish avatar Nov 28 '16 05:11 juncaofish

The mandarin acoustic model is too big for MT7688 to do continuous speech recognizer. You need a small one.

xiongyihui avatar Nov 28 '16 08:11 xiongyihui

I find a tool https://github.com/Kitt-AI/snowboy which seems to be a perfect solution for Chinese hotword local detection. However, it seems it cannot be targeted to MT7688 either for the hardware architecture limit. Do you have any suggestions on how to take less effort for Chinese hotword detection task? Thanks very much.

juncaofish avatar Nov 28 '16 08:11 juncaofish

It seems the snowboy is the most accurate. Ask for kitt-ai to compile a MIPS version of snowboy maybe work. There are old reversions of mandrarin acoustic model in cmusphinx repo which is smaller and should be able to run on MT7688, or train a smaller model by hand.

xiongyihui avatar Nov 29 '16 12:11 xiongyihui

I'm afraid the MT7688 is not a supported target for snowboy. Take a reference at discussion here: https://groups.google.com/a/kitt.ai/forum/#!topic/snowboy-discussion/iL4rRZRUk4U

juncaofish avatar Nov 30 '16 09:11 juncaofish

is it possible to use Xunfei wakeup on Respeaker, I mean compile the sdk for 7688's mips platform? It seems like a good solution if cost is not considered as a cons.

juncaofish avatar Dec 01 '16 08:12 juncaofish

I applied a sdk of xunfei's stt and stt running on 7688, it worded. The wakeup sdk should work too.

xiongyihui avatar Dec 01 '16 08:12 xiongyihui

Since I have tried the online sdk on Raspberry Pi, I guess you mean the online tts/stt service of xunfei. Do you need to submit the compile chain to their engineer? I remember the sdk only provides .so for x86/x64 .

juncaofish avatar Dec 01 '16 09:12 juncaofish

Yes, for c sdk for 7688, we need to submit the toolchain.

xiongyihui avatar Dec 01 '16 10:12 xiongyihui

I am intersted in using xunfei's wakeup sdk.... i have download the pack from there, and how can i use it in respeaker?

calvinwongsc avatar Feb 21 '17 09:02 calvinwongsc

yihui 你好 我在mac上执行bing_translator.py的时候,提示DEBUG:urllib3.connectionpool:https://oxford-speech.cloudapp.net:443 "POST /token/issueToken HTTP/1.1" 401 143 00000000000recognition connection failed,我已经提供了申请的密钥 1: ,这个错误是什么问题,国内网络限制吗?

jiaoxu avatar Aug 03 '17 03:08 jiaoxu

Is there any way to actually run snowboy detect on 7688, if so how? Thanks

solidssss avatar Mar 14 '21 17:03 solidssss