Mark Backman

Results 240 comments of Mark Backman

@krishvadhani19 does this require a Business Plan ($699/mo) to test? From the [docs](https://docs.app.resemble.ai/docs/text_to_speech/streaming_websocket), I see: > Note: Websocket API is only available for Business plan users. If you're running into...

@krishvadhani19 I just removed your message as it contained a key. You might want to rotate it. If you want to share a key, that would be great, but it...

@krishvadhani19 Sorry for so many comments! There's a lot to take into account for building a TTS service. One more question: do you have an idle timeout (e.g. disconnect the...

Please provide a repro case, including a concise code example and steps to reproduce. As far as we know and our testing shows, this is not possible. The bot's output...

Very cool! Can you add an example, so this is easy to test?

This seems like a helpful change from a debugging standpoint. @jjmaldonis can you add a short changelog entry for this? Also, please lint the code (scripts/fix-ruff.sh) or install the pre-commit...

`USE_ONLY_LAST_VAD_SEGMENT` means the last user turn (e.g. audio between `VADUserStartedSpeakingFrame` and`VADUserStoppedSpeakingFrame`). This ensure that the latest sample is provided to the smart-turn model. How the speech is segmented depends on...

Right. The idea is that the smart-turn model was trained on audio segments of 2-8 sec (IIRC) and the content of the audio is irrelevant. It's the audio frequency/pattern itself...

We're refactoring how interruptions work. Once that work lands, then I would expect DeepgramFlux to fit better into the new paradigm, making service switching viable. The interruptions refactor is happening...

Yes, this is already supported using the Pipecat client SDKs. You can check out this example: https://github.com/pipecat-ai/pipecat-examples/tree/main/websocket Using the WebSocketTransport, the client and server can communicate the RTVI events, which...