sherpa-onnx issues

Provision to include Fastspeech2 acoustic models

1

Hi..Thanks for the amazing work.. I am mainly following the TTS framework..The Matcha+Vocos pipeline gives excellent inference time with decent synthesized audios (tested with LJSpeech).. Is it possible to include...

anirpipi

add phoonnx models

5

hello I am working in my own TTS engine https://github.com/TigreGotico/phoonnx when using the espeak phonemizer the models are compatible with piper TTS, in fact you already are using my models...

JarbasAl

Why parakeet (transducer model) can't support hotwords?

9

Why parakeet can't support hotwords? I check code in offline-recognizer-transducer-nemo-impl.h `class OfflineRecognizerTransducerNeMoImpl : public OfflineRecognizerImpl { public: explicit OfflineRecognizerTransducerNeMoImpl( const OfflineRecognizerConfig &config) : OfflineRecognizerImpl(config), config_(config), symbol_table_(config_.model_config.tokens), model_(std::make_unique( config_.model_config)) { if...

paulxin001

How to reduce onnxruntime.xcframework size for iOS

1

Hi, thanks for the great framework! I'm trying to use a VITS TTS model on my iOS app, but the `onnxruntime.xcframework` is quite large (around 64.6 MB, v1.17.1). This is...

kim-sangyeon

flet example

9

Hi here, I hope we have a flet example https://flet.dev/ so that with python code to build app, desktop, web... in once for all. thanks.

SamYuan1990

ubuntu server上有哪些ASR、TTS模型是支持流式

3

因为我们需要一个实时对话，本地部署的。

manhongnie

Empty transcription with "parakeet-tdt-0.6b-v2" on some Files

6

Hi, First of all, thank you very much for adding all the latest models to sherpa-onnx. However, it seems there might be an issue with the implementation of the recently...

ahazned

[Flutter] Hot release doesn't free up the RAM taken by the model

2

In the development, I start the `Isolate.spawn` with a `kokoro-en-v0_19` model. After hot restarting the app, it doesn't free the memory used in the previous VM instance. It is reproducable...

esuljic

请问当前TTS模型如何实现流式输出功能？

2

Thanks for your great job! 我现在的安卓设备通过离线接口调用LLM的API，问答输出内容是流式stream的。为了用户体验希望将回答结果以流式方式TTS播放出来，请问[https://github.com/k2-fsa/sherpa-onnx/tree/master/android/SherpaOnnxTtsEngine](url) 支持流式播放TTS结果吗？效果类似于端到端语音大模型，我输入语音，大模型同步播放回答内容的音频。

Yfredy

请问当前是否支持HarmonyOS上的kws功能？

3

当前的最新版本是HarmonyOS 6.0.0(20)

guoshaoxinabc

sherpa-onnx
sherpa-onnx copied to clipboard

Metadata

Provision to include Fastspeech2 acoustic models

add phoonnx models

Why parakeet (transducer model) can't support hotwords?

How to reduce onnxruntime.xcframework size for iOS

flet example

ubuntu server上有哪些ASR、TTS模型是支持流式

Empty transcription with "parakeet-tdt-0.6b-v2" on some Files

[Flutter] Hot release doesn't free up the RAM taken by the model

请问当前TTS模型如何实现流式输出功能？

请问当前是否支持HarmonyOS上的kws功能？

← Metadata

Owner

Metadata

sherpa-onnx sherpa-onnx copied to clipboard

Metadata

← Metadata

Owner

Metadata

sherpa-onnx
sherpa-onnx copied to clipboard