TEN-Agent icon indicating copy to clipboard operation
TEN-Agent copied to clipboard

[FEATURE] Hope to add the SenseVoice Multilingual Voice Understanding Model

Open guihaoqun opened this issue 9 months ago • 2 comments

Description

I hope to add the SenseVoice speech recognition model. The various TTS extensions on the TEN framework are too mechanical and lack emotion. SenseVoice performs better in this regard. It is recommended to add it.

Severity

Critical

Additional Information

https://github.com/FunAudioLLM/SenseVoice

SenseVoice is a speech foundation model with multiple speech understanding capabilities, including automatic speech recognition (ASR), spoken language identification (LID), speech emotion recognition (SER), and audio event detection (AED).

guihaoqun avatar Mar 27 '25 07:03 guihaoqun

same here

seetimee avatar Jun 24 '25 07:06 seetimee

SenseVoice is an stt extension, not a tts extension.

Are you referring to the integration with the commercial SenseVoice api or the local SenseVoice model

AI-J-IN avatar Nov 18 '25 05:11 AI-J-IN