sherpa-onnx issues

speaker-identification-with-vad-non-streaming-asr.py not support the sense_voice ASR model.

![Image](https://github.com/user-attachments/assets/f4973ed0-3abf-44fa-a8ab-581d7e951695) Lack of support for sense_voice.

VisminLab

Is it support word-level timestamps?

Hi I want to know is it support word-level timestamps for text to speech? Thanks

amira133

能否在不初始化的情况下获取模型的speakers呢？

1

我看到huggingface的TTS演示案例中，”select a model”的下拉选项中标注了Speakers的数量，我想用C#复现一个这种列表，但是依次初始化每个模型再获取NumSpeakers的效率太低了。我通过FileStream只读取模型文件最后1K的字节，再匹配“n_speakers”前缀和“r”后缀截取Speakers数量，目前来看是似乎是能做到快速读取到每个模型的Speakers数量，但是我不知道是否有必要这么做，也不知道这么做能不能适配每个模型

406832098

Add cache mechanism to sherpa tts

23

Delay is a major drawback of sherpa tts on android phones, which makes it unusable for average blind people using average phones. By caching the most frequent tts requests, sherpa...

mah92

Exception Thrown When Debugging C# Windows x86 Application in Visual Studio Due to Calling Convention Mismatch (stdcall vs cdecl)

2

Exception: Managed Debugging Assistant 'PInvokeStackImbalance' Message=Managed Debugging Assistant 'PInvokeStackImbalance' : 'A call to PInvoke function 'sherpa-onnx!SherpaOnnx.KeywordSpotter::SherpaOnnxCreateKeywordSpotter' has unbalanced the stack. This is likely because the managed PInvoke signature does not...

ajon88

期待支持tts的朗读跟随能力。

9

实时的朗读跟随算是小说阅读app的基本功能之一，期望作者大大支持。

SusionSuc

C# Objects that expect to be Disposed do not implement IDisposable

Hello In the implementation for objects like OfflineTtsGeneratedAudio, I notice that there is a Dispose method but the class does not implement the IDisposable interface. This means we must manually...

jacob-mink-1996

riscv64-unknown-linux-musl 交叉编译报错libonnxruntime.so: undefined reference xxx

11

你好，我使用sherpa-onnx-1.10.43的代码，编译riscv64平台时遇到以下问题 /usr/bin/cmake: /usr/local/lib/libcurl.so.4: no version information available (required by /usr/bin/cmake) /opt/toolschain/zam70/riscv64-linux-musl-x86_64/bin/riscv64-unknown-linux-musl-g++ -Wl,-rpath='/opt/toolschain/zam70/riscv64-linux-musl-x86_64/sysroot/lib' -mcpu=c906fdv -march=rv64imafdcv0p7xthead -mcmodel=medany -mabi=lp64d -O3 -DNDEBUG -flto -fno-fat-lto-objects CMakeFiles/sherpa-onnx.dir/sherpa-onnx.cc.o -o ../../bin/sherpa-onnx -Wl,-rpath,"\$ORIGIN:/home/nongbojian/workcode/numbers/kokoro/sherpa-onnx-1.10.43/sherpa-onnx-1.10.43/build-riscv64-linux-musl/_deps/onnxruntime-src/lib:" ../../lib/libsherpa-onnx-core.a -Wl,-rpath,$ORIGIN/../lib -Wl,-rpath,$ORIGIN/../../../sherpa_onnx/lib ../../lib/libkaldi-native-fbank-core.a ../../lib/libkaldi-decoder-core.a ../../lib/libsherpa-onnx-kaldifst-core.a...

bjNong

希望kokoro tts能支持日语，或者提供自己制作日语支持的说明

3

XDesktopSoft

unable pronounce a person's name, e.g., lucy, in iOS swift ui version

3

I follow the instruction on this URL: https://github.com/k2-fsa/sherpa-onnx/pull/1737 but I find it is unable to pronounce a person's name, e.g., lucy, lily, in iOS swift ui version.

tedShadow

sherpa-onnx
sherpa-onnx copied to clipboard

Metadata

speaker-identification-with-vad-non-streaming-asr.py not support the sense_voice ASR model.

Is it support word-level timestamps?

能否在不初始化的情况下获取模型的speakers呢？

Add cache mechanism to sherpa tts

Exception Thrown When Debugging C# Windows x86 Application in Visual Studio Due to Calling Convention Mismatch (stdcall vs cdecl)

期待支持tts的朗读跟随能力。

C# Objects that expect to be Disposed do not implement IDisposable

riscv64-unknown-linux-musl 交叉编译报错libonnxruntime.so: undefined reference xxx

希望kokoro tts能支持日语，或者提供自己制作日语支持的说明

unable pronounce a person's name, e.g., lucy, in iOS swift ui version

← Metadata

Owner

Metadata

sherpa-onnx sherpa-onnx copied to clipboard

Metadata

← Metadata

Owner

Metadata

sherpa-onnx
sherpa-onnx copied to clipboard