MNN icon indicating copy to clipboard operation
MNN copied to clipboard

Can we get the recognized speech original text from the audio?

Open JedLee6 opened this issue 7 months ago • 1 comments

Dear developers. Could we get the recognized speech original text from the audio in the Qwen2.5-Omni" and "Qwen2-Audio" model? So we can help user to verify if the AI assistant recognized what the user saied correctly? Thanks a lot!

Image

JedLee6 avatar May 18 '25 08:05 JedLee6

no. for the model used audio directly for input.did not transfer to text first

Juude avatar May 22 '25 04:05 Juude

Marking as stale. No activity in 60 days.

github-actions[bot] avatar Jul 28 '25 09:07 github-actions[bot]