Mark Backman comments

Results 240 comments of


                                            Mark Backman

Resemble's Websocket TTS integration with Pipecat

@krishvadhani19 does this require a Business Plan ($699/mo) to test? From the [docs](https://docs.app.resemble.ai/docs/text_to_speech/streaming_websocket), I see: > Note: Websocket API is only available for Business plan users. If you're running into...

Resemble's Websocket TTS integration with Pipecat

@krishvadhani19 I just removed your message as it contained a key. You might want to rotate it. If you want to share a key, that would be great, but it...

Resemble's Websocket TTS integration with Pipecat

@krishvadhani19 Sorry for so many comments! There's a lot to take into account for building a TTS service. One more question: do you have an idle timeout (e.g. disconnect the...

Bot Stops speaking mid-sentence

Please provide a repro case, including a concise code example and steps to reproduce. As far as we know and our testing shows, this is not possible. The bot's output...

Add File API to GeminiMultimodalLive

Very cool! Can you add an example, so this is easy to test?

deepgram: added request IDs to debug logs

This seems like a helpful change from a debugging standpoint. @jjmaldonis can you add a short changelog entry for this? Also, please lint the code (scripts/fix-ruff.sh) or install the pre-commit...

Why does Smart Turn Analyzer Have USE_ONLY_LAST_VAD_SEGMENT=True

`USE_ONLY_LAST_VAD_SEGMENT` means the last user turn (e.g. audio between `VADUserStartedSpeakingFrame` and`VADUserStoppedSpeakingFrame`). This ensure that the latest sample is provided to the smart-turn model. How the speech is segmented depends on...

Why does Smart Turn Analyzer Have USE_ONLY_LAST_VAD_SEGMENT=True

Right. The idea is that the smart-turn model was trained on audio segments of 2-8 sec (IIRC) and the content of the audio is irrelevant. It's the audio frequency/pattern itself...

Get Deepgram Flux working with ServiceSwitcher

We're refactoring how interruptions work. Once that work lands, then I would expect DeepgramFlux to fit better into the new paradigm, making service switching viable. The interruptions refactor is happening...

Interruption handling over FastAPIWebsocket not working when Bot speaking

Yes, this is already supported using the Pipecat client SDKs. You can check out this example: https://github.com/pipecat-ai/pipecat-examples/tree/main/websocket Using the WebSocketTransport, the client and server can communicate the RTVI events, which...