cookbook
cookbook copied to clipboard
Audio Output Issues in Multimodal Live API
Description of the bug:
I am using the following code:
https://github.com/google-gemini/cookbook/blob/main/gemini-2/live_api_starter.py
This code was working fine 4 days ago, but today I’m encountering the following issues in the output:
- Words are frequently cut off or missing.
- Sentences and words are jumbled, making the audio difficult to understand.
- There is significant repetition of words and sentences.
I am attaching a file where I asked Gemini to provide information about Manchester United. The attached link contains the audio file I get in the streamed output.
https://drive.google.com/file/d/1ejDX9wlxt6wd0qM3-zDLTGh6g0GAAryv/view?usp=sharing
Actual vs expected behavior:
Expected Behavior: The audio output should be smooth, clear, and easily understandable.
Actual Behavior: While the audio output was fine 4 days ago, the following discrepancies were observed today:
- Words are frequently cut off or missing.
- Sentences and words are jumbled.
- Significant repetition of words and sentences reduces clarity and usability.
Any other information you'd like to share?
No response