sherpa-onnx
sherpa-onnx copied to clipboard
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android...
Hello, is it possible to ise scripts for nemo model conversion to convert this model? https://huggingface.co/nvidia/canary-1b I'm asking because it supports put of the mox also translation and other tasks...
那样就可以在web端达到实时了 /(ㄒoㄒ)/~~
我在使用基于CTC解码的模型做语音识别,例如sensevoice,很多识别结果发音正确,但不是我想要的字,请问后续对于hotwords的方案有计划支持么?
On my Fairphone 4, the F-Droid version of SherpaTTS crashes when I try to open it. The logcat shows lots of errors like this one: ``` 02-05 19:32:56.948 22822 22822...
Currently the WASM models are extremely painful to get working once you start integrating the example code you get from downloading the WASM releases. We had to rewrite `app-vad-asr.js` entirely...
We can replicate it from https://github.com/thewh1teagle/kokoro-onnx
``` System.DllNotFoundException: Unable to load shared library '/home/zhenyapav/Projects/test/data_AI_Desktop_Companion_linuxbsd_x86_64/libsherpa-onnx-c-api.so' or one of its dependencies. In order to help diagnose loading problems, consider using a tool like strace. If you're using glibc,...
I wanna build the application for speaker identification using sherpa-onnx on flutter but when extract the embedding audio, it given the empty list. I already debug to check that the...
崩溃堆栈如下 #0 0x000000010fa08f4c in sherpa_onnx::Clone () #1 0x000000010fa0283c in sherpa_onnx::OnlineZipformer2TransducerModel::StackStates () #2 0x000000010f9747fc in sherpa_onnx::KeywordSpotterTransducerImpl::DecodeStreams () #3 0x000000010f952c0c in SherpaOnnxDecodeKeywordStream ()
使用维基百科文本 训练一个更好的TTS模型 https://huggingface.co/datasets/wikimedia/wikipedia https://dumps.wikimedia.org/backup-index.html