sherpa-onnx icon indicating copy to clipboard operation
sherpa-onnx copied to clipboard

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android...

Results 419 sherpa-onnx issues
Sort by recently updated
recently updated
newest added

Hi @csukuangfj , I have refactored the previous implementation, Below code base is functional tested and cleaned up I think you can suggest more for naming for more clarity or...

I need assistance with understanding why my mic cant detect any keywords : this is how i set it up , howvever everytime i try to print the keyword it...

连字符(hyphen)​​ (Unicode: U+2010,键盘直接输入减号键 -) 如: short-term risks with long-term rewards. 连字符(hyphen -)用于连接单词或音节,形成复合词或拆分换行,朗读时需​​连贯无停顿 破折号(Em Dash — 或 ——)​​通常需要明显停顿 目前的情况是:在朗读英文文本时, 对连字符有明显停顿,对破折号没有停顿,这和正确读法刚好相反 (sherpa-onnx-non-streaming-tts-x64-v1.11.3.exe) ![Image](https://github.com/user-attachments/assets/a8f244de-70b9-42a8-b478-c1db315e8f86)

Dear sherpa-onnx Developers, I am a Flutter developer and I am very interested in integrating the powerful speech processing capabilities of sherpa-onnx, particularly Voice Activity Detection (VAD) and speaker recognition,...

![Image](https://github.com/user-attachments/assets/bd063ee3-3479-431f-8c0d-326c5a0f08c9) 我想使用[sherpa-onnx-streaming-paraformer-bilingual-zh-en.tar.bz2](https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/sherpa-onnx-streaming-paraformer-bilingual-zh-en.tar.bz2)的模型,但沒有範例中的joiner ![Image](https://github.com/user-attachments/assets/f4e9a99f-d6b4-431c-a975-409af1653fec) 請問該怎麼辦?

After seeing this merged PR (https://github.com/k2-fsa/sherpa-onnx/pull/1820) I thought pauses could be controlled. But seems not likely. In that PR there is a new config parameter named "silence_scale", but it seems...

我看在输入框有 Input your keywords here, one keyword per line.\nTwo example keywords are given below:\n\nn ǐ h ǎo @你好\nd àn g ē d àn g ē @蛋哥蛋哥 ,不是很理解 自定义关键字需要 添加 n...

I'm testing speech recognition from a microphone with endpoint detection using the provided Python [example](https://github.com/k2-fsa/sherpa-onnx/blob/master/python-api-examples/speech-recognition-from-microphone-with-endpoint-detection.py), and I've found the following issue: when the first token of a segment is the...

## 问题 目前的算法是如果识别到1.5秒静默帧或者识别到唤醒词才会reset清除之前识别出来的token,但是如果有一个唤醒词没有识别出来,接下来相似的唤醒词就有很大几率无法识别。 ## 建议 每次beam search仅搜索当前帧和之前一定时间内(如2秒)的结果

使用的部分应该有两个, 但是这个地方工程配置上看看需不需要补充一下 ![Image](https://github.com/user-attachments/assets/420ce7f8-bcd9-4161-a40a-f8069d957988) 我看没有对应的文档,但是对应的ncnn的有一个文档 https://k2-fsa.github.io/sherpa/ncnn/ios/for-the-more-curious-swift.html 看看这部分是不是对onnx的部分可以补充一下