agents
agents copied to clipboard
Build real-time multimodal AI applications 🤖🎙️📹
If you said agent.say(), but your before_tts_cb returned an AsyncIterable, it'd throw - because the transcript was a string but the before_tts_cb was AsyncIterable.
For example, voice conversion
I have encountered an issue with the voice assistant when synthesizing Chinese text. The time interval between LLM and synthesized speech outputs is noticeably longer when the output is in...
I would like to consult about the best practices for deploying services, particularly in scenarios involving stateful agent-services. Currently, I have a program running, and I have added some new...
This pull request fixes the SSML implementation in the Azure Text-to-Speech (TTS) integration by correctly adding the tag. This ensures that the SSML is properly structured and the specified voice...
This PR was opened by the [Changesets release](https://github.com/changesets/action) GitHub action. When you're ready to do a release, you can merge this and the packages will be published to npm automatically....
The worker will exit due to websocket abnormal disconnection, causing the chat to be interrupted abnormally. Automatic reconnect OpenAI websocket after disconnection.
I'm looking for an elegant way to save the audio output of the Text-to-Speech (TTS) system, specifically the audio that is played back to users during interactions, as well as...
I've gotten tripped up by this a bunch.
When I use OpenAI TTS there is no problem, but when I use ElevenLabs TTS, it says the first message but then, I get this message in the terminal: {"message":...