sherpa-onnx icon indicating copy to clipboard operation
sherpa-onnx copied to clipboard

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android...

Results 419 sherpa-onnx issues
Sort by recently updated
recently updated
newest added

![Image](https://github.com/user-attachments/assets/f4973ed0-3abf-44fa-a8ab-581d7e951695) Lack of support for sense_voice.

Hi I want to know is it support word-level timestamps for text to speech? Thanks

我看到huggingface的TTS演示案例中,”select a model”的下拉选项中标注了Speakers的数量,我想用C#复现一个这种列表,但是依次初始化每个模型再获取NumSpeakers的效率太低了。 我通过FileStream只读取模型文件最后1K的字节,再匹配“n_speakers”前缀和“r”后缀截取Speakers数量,目前来看是似乎是能做到快速读取到每个模型的Speakers数量,但是我不知道是否有必要这么做,也不知道这么做能不能适配每个模型

Delay is a major drawback of sherpa tts on android phones, which makes it unusable for average blind people using average phones. By caching the most frequent tts requests, sherpa...

Exception: Managed Debugging Assistant 'PInvokeStackImbalance' Message=Managed Debugging Assistant 'PInvokeStackImbalance' : 'A call to PInvoke function 'sherpa-onnx!SherpaOnnx.KeywordSpotter::SherpaOnnxCreateKeywordSpotter' has unbalanced the stack. This is likely because the managed PInvoke signature does not...

实时的朗读跟随算是小说阅读app的基本功能之一,期望作者大大支持。

Hello In the implementation for objects like OfflineTtsGeneratedAudio, I notice that there is a Dispose method but the class does not implement the IDisposable interface. This means we must manually...

你好,我使用sherpa-onnx-1.10.43的代码,编译riscv64平台时遇到以下问题 /usr/bin/cmake: /usr/local/lib/libcurl.so.4: no version information available (required by /usr/bin/cmake) /opt/toolschain/zam70/riscv64-linux-musl-x86_64/bin/riscv64-unknown-linux-musl-g++ -Wl,-rpath='/opt/toolschain/zam70/riscv64-linux-musl-x86_64/sysroot/lib' -mcpu=c906fdv -march=rv64imafdcv0p7xthead -mcmodel=medany -mabi=lp64d -O3 -DNDEBUG -flto -fno-fat-lto-objects CMakeFiles/sherpa-onnx.dir/sherpa-onnx.cc.o -o ../../bin/sherpa-onnx -Wl,-rpath,"\$ORIGIN:/home/nongbojian/workcode/numbers/kokoro/sherpa-onnx-1.10.43/sherpa-onnx-1.10.43/build-riscv64-linux-musl/_deps/onnxruntime-src/lib:" ../../lib/libsherpa-onnx-core.a -Wl,-rpath,$ORIGIN/../lib -Wl,-rpath,$ORIGIN/../../../sherpa_onnx/lib ../../lib/libkaldi-native-fbank-core.a ../../lib/libkaldi-decoder-core.a ../../lib/libsherpa-onnx-kaldifst-core.a...

I follow the instruction on this URL: https://github.com/k2-fsa/sherpa-onnx/pull/1737 but I find it is unable to pronounce a person's name, e.g., lucy, lily, in iOS swift ui version.