sherpa-onnx icon indicating copy to clipboard operation
sherpa-onnx copied to clipboard

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android...

Results 419 sherpa-onnx issues
Sort by recently updated
recently updated
newest added

使用命令行生成中文字热词文件后,在tokens.txt能找到对应的byte,但是使用时有异常 ![微信图片_20240508112931](https://github.com/k2-fsa/sherpa-onnx/assets/166801697/a391aab4-3442-48ee-819e-8e80c56e3e39) ![微信图片_20240508113007](https://github.com/k2-fsa/sherpa-onnx/assets/166801697/cafcb26b-1f6d-4666-a070-28d037f1e2a1) ![微信图片_20240508113014](https://github.com/k2-fsa/sherpa-onnx/assets/166801697/c134ac7d-2183-4d6f-8682-a2b753ffab0c)

按照[kws文档说明](https://k2-fsa.github.io/sherpa/onnx/kws/index.html)的指引 `sherpa-onnx-keyword-spotter --help`,使用命令行工具执行语音唤醒脚本 貌似输出只有配置信息,没有看到识别结果,但是执行的脚本是编译后的格式,无法检查或者print日志信息。 不知道下一步怎么做了。 执行: ``` ./build/bin/sherpa-onnx-keyword-spotter \ --tokens=sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/tokens.txt \ --encoder=sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/encoder-epoch-12-avg-2-chunk-16-left-64.onnx \ --decoder=sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/decoder-epoch-12-avg-2-chunk-16-left-64.onnx \ --joiner=sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/joiner-epoch-12-avg-2-chunk-16-left-64.onnx \ --provider=cpu \ --num-threads=2 \ --keywords-file=sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/test_wavs/test_keywords.txt \ sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/test_wavs/0.wav ``` 输出日志: ``` /home/taorui/sherpa-onnx/sherpa-onnx/csrc/parse-options.cc:Read:361 ./build/bin/sherpa-onnx-keyword-spotter --tokens=sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01/tokens.txt...

When I set decoding method to 'modified_beam_search' it returns letters/words on silence, after short time, periodically. But when I use default 'greedy_search' decoding method everything works well. As an example...

I tried to follow the instruction to build text-to-speech with WebAssembly. https://k2-fsa.github.io/sherpa/onnx/tts/wasm/index.html When I used English language as in instruction. It worked well. But when I tried to use some...

Hi all, Is it possible to add the possibility to pipe text into the tts engine using cat or a command line parameter? That would help greatly when converting large...

Hi, That would be interesting and very beneficial for Windows users of screenreaders and other speech based software. Adding a SAPI 5 driver for text to speech to interface with...

Hi all, Just tested the tts engine on android 14 on a phone with an 64 bit processor. I also tested tts with the talkback screenreader. It works very well...

Hi all, This seems to affect tts in general. Tested on Android 14 and Windows 11. the generated speech seems to be very quiet and it seems to be under...

Hi all, It would be nice if it would be possible to add this feature so audio can be piped directly for example into ffmpeg or lame. So generating a...