Can we get the recognized speech original text from the audio?

Open JedLee6 opened this issue 7 months ago • 1 comments

Dear developers. Could we get the recognized speech original text from the audio in the Qwen2.5-Omni" and "Qwen2-Audio" model? So we can help user to verify if the AI assistant recognized what the user saied correctly? Thanks a lot!

May 18 '25 08:05 JedLee6

no. for the model used audio directly for input.did not transfer to text first

May 22 '25 04:05 Juude

Marking as stale. No activity in 60 days.

Jul 28 '25 09:07 github-actions[bot]