sherpa-onnx issues

ONNX inference optimization

Hello, development team! At the moment, I’m experimenting with giga-rnnt-v2, focusing on parallel inference of the model. What has been done so far: 0. The model sherpa-onnx-nemo-transducer-giga-am-v2-russian-2025-04-19.tar.bz2 was downloaded from...

Qeshtir

Add LODR support to online and offline recognizers

9

This PR adds LODR support from Icefall to offline and online recognizers for both LM shallow fusion and LM rescore. (see https://k2-fsa.github.io/icefall/decoding-with-langugage-models/LODR.html) Usage example: ``` # offline LM rescore sherpa-onnx-offline...

vsd-vector

support canary-180m-flash

1

hello The canary-180m is ranked 9th in the https://huggingface.co/spaces/hf-audio/open_asr_leaderboard ASR ranking. With a size of only 700MB and very good transcription accuracy, it is very suitable for local transcription. Is...

jc955

解决windows编译isspace函数调用失败问题

1

GlocKieHuan

ASR English model cannot recognize my voice well

14

why all the english related model in asr cannot recognize well? though Chinese ones seem work well. Have you tested the english related models? Am i missing anything to make...

jims57

支持将sherpa-onnx转为rknn格式在npu上运行吗

11

大佬，有提供教程吗

lqx-all

Support for Arabic diacritization in Arabic TTS

4

Hi In the Arabic context, short vowels are not written directly. So e-speak is not capable of reading correctly. There has been a great amount of research for models inheriting...

mah92

使用cuda进行tts播放语音时提示no kernel image is available for execution on the device，并且出现段错误

6

您好，我在使用sherpa-onnx-offline-tts-play进行tts推理时出现，能否帮忙看看是什么原因？谢谢。下面是运行的打印信息： ./sherpa-onnx-offline-tts-play --vits-model=./vits-melo-tts-zh_en/model.onnx --vits-lexicon=./vits-melo-tts-zh_en/lexicon.txt --vits-tokens=./vits-melo-tts-zh_en/tokens.txt --tts-rule-fsts="./vits-melo-tts-zh_en/date.fst,./vits-melo-tts-zh_en/number.fst" --vits-dict-dir=./vits-melo-tts-zh_en/dict --provider="cuda" --output-filename=../zh-en-2.wav "测试tts样例，1000识别，" /home/ubuntu/work/git_program/deeplearning/text-to-speech/sherpa-onnx/sherpa-onnx/csrc/parse-options.cc:Read:375 .//sherpa-onnx-offline-tts-play --vits-model=./vits-melo-tts-zh_en/model.onnx --vits-lexicon=./vits-melo-tts-zh_en/lexicon.txt --vits-tokens=./vits-melo-tts-zh_en/tokens.txt --tts-rule-fsts=./vits-melo-tts-zh_en/date.fst,./vits-melo-tts-zh_en/number.fst --vits-dict-dir=./vits-melo-tts-zh_en/dict --provider=cuda --output-filename=../zh-en-2.wav '测试tts样例，1000识别，' ALSA lib pcm.c:2495:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.rear ALSA lib pcm.c:2495:(snd_pcm_open_noupdate)...

guitLearn

热词和关键字识别支持RK板子得RKNN模型吗

1

热词和关键字识别支持RK板子的RKNN模型吗，因为想用到RK的NPU

xiaohuihuige

TTS vits-icefall-zh-aishell3 sid=66 模型的发声

6

网页上rule-66.wav的发声很清楚。我编译后用相同的指令得到的声音很奇怪。网页上rule-66.wav的发声是怎么实现的

fangliangs

sherpa-onnx
sherpa-onnx copied to clipboard

Metadata

ONNX inference optimization

Add LODR support to online and offline recognizers

support canary-180m-flash

解决windows编译isspace函数调用失败问题

ASR English model cannot recognize my voice well

支持将sherpa-onnx转为rknn格式在npu上运行吗

Support for Arabic diacritization in Arabic TTS

使用cuda进行tts播放语音时提示no kernel image is available for execution on the device，并且出现段错误

热词和关键字识别支持RK板子得RKNN模型吗

TTS vits-icefall-zh-aishell3 sid=66 模型的发声

← Metadata

Owner

Metadata

sherpa-onnx sherpa-onnx copied to clipboard

Metadata

← Metadata

Owner

Metadata

sherpa-onnx
sherpa-onnx copied to clipboard