sherpa-onnx
sherpa-onnx copied to clipboard
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android...
我尝试所了有模型,遇到符号 暂停朗读 太久 一点都不流畅. - 忽略 所有 中文 英文 符号来解决这个问题. 你可以试试下面文本 ``` 使*用新。一代,卡尔,迪的语.音合?成!引擎 ```
https://github.com/k2-fsa/sherpa-onnx/blob/ecc653871d305c79002d2630c7cf0d0e1d6bf1ed/sherpa-onnx/csrc/online-recognizer-transducer-impl.h#L382C1-L383C53 @csukuangfj We have observed an issue since version `1.10.0` of sherpa-onnx. Specifically, the encoder state reset functionality was commented out in the following file: Path: `sherpa-onnx/sherpa-onnx/csrc/online-recognizer-transducer-impl.h` ``` // reset...
Hi, I need the IOS arm64 simulator C API build for use with [`sherpa-rs`](https://github.com/thewh1teagle/sherpa-rs) on macOS M1. Update: Maybe I was wrong and it's universal library. Unfortunately I have issues...
https://github.com/k2-fsa/sherpa-onnx/pull/1732 Delay is a major drawback of sherpa tts on android phones, which makes it unusable for average blind people using average phones. By caching the most frequent tts requests,...
I want to know what format and what type of language model (lm) we can use for the online and offline recognizer, I am using a transducer model. I want...
In the Sherpa-ONNX VITS-based TTS pipeline, we want to convert the input text into phonemes beforehand and pass the phoneme sequence directly to the ONNX model, instead of passing the...
Using the python-api-example ```bash python3 ./python-api-examples/offline-zeroshot-tts.py \ --zipvoice-flow-matching-model sherpa-onnx-zipvoice-distill-zh-en-emilia/fm_decoder.onnx \ --zipvoice-text-model sherpa-onnx-zipvoice-distill-zh-en-emilia/text_encoder.onnx \ --zipvoice-data-dir sherpa-onnx-zipvoice-distill-zh-en-emilia/espeak-ng-data \ --zipvoice-pinyin-dict sherpa-onnx-zipvoice-distill-zh-en-emilia/pinyin.raw \ --zipvoice-tokens sherpa-onnx-zipvoice-distill-zh-en-emilia/tokens.txt \ --zipvoice-vocoder sherpa-onnx-zipvoice-distill-zh-en-emilia/vocos_24khz.onnx \ --prompt-audio sherpa-onnx-zipvoice-distill-zh-en-emilia/prompt.wav \ --zipvoice-num-steps 4...
使用matcha_tts_zh_en_20251010中英混合模型。播报英文的时候,无法播报。日志过滤提示如下:
编写flutter安卓在3566的板子上运行,需要利用板子的NPU,我对这块不是很了解但是文档都翻了一下尝试过自己转RKNN,各种报错无法转出。 https://github.com/k2-fsa/sherpa-onnx/releases/download/kws-models/sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01.tar.bz2 这个是从仓库里面找到的KWS的onxx的模型,安卓环境so文件都配置好了,我已经卡了好几天了实在没办法求助一下
ref https://github.com/k2-fsa/sherpa-onnx/blob/master/flutter-examples/streaming_asr/lib/online_model.dart#L6 or https://github.com/k2-fsa/sherpa-onnx/blob/master/flutter-examples/non_streaming_vad_asr/lib/offline_model.dart#L7 in fact we have implemented Strategy Pattern as line above, but to provide more convenience when using sdk or allow end user to select model by...