agents
agents copied to clipboard
Implement Manual VAD Commit via Button for Controlled Speech Processing
I've implemented a button in the client that is supposed to ensure VAD (Voice Activity Detection) doesn't immediately commit my conversation and send it to the server. Instead, it should wait until I click the button again. My issue is that I can't find a function in the agent that allows me to mute or pause VAD, or at least make it wait until I manually commit the conversation to the server. The transport of the button click, etc., is already working perfectly via the 'data_received' event.
In short, VAD should wait for my command until the user has finished speaking, and only then should the agent start processing or transmitting the speech. Essentially, I need a simple button that manually commits the spoken input. Does anyone have any ideas on the best way to approach this? Perhaps I need the assistant to wait for a commit or something similar?
Let me know if you need further adjustments!