Mark Backman comments

Results 240 comments of


                                            Mark Backman

Meeting translation pipeline suggestion

The trickiest part of the problem is detecting which language is being spoken in the first place. The best solution would be to select an STT solution that can transcribe...

Meeting translation pipeline suggestion

One pipeline should suffice. You could also use a multimodal LLM like Gemini Live for this to take audio in and translate. I would imagine it will be up for...

Meeting translation pipeline suggestion

Yes! GladiaSTTService has a really nice implementation that makes translation easy. In fact, Pipecat now yields a TranslationFrame in addition to the TranscriptionFrame when enabling this for their service. Check...

Nova Sonic breaks on tool cancellation

It looks like you're mixing ways of setting tools. You need to either use the constructor or use the context. The current best practice is to use the LLMContext, which...

Heygen LiveAvatar Integration

In talking with the HeyGen team, we'll work to migrate to LiveAvatars. They're not ready for migration yet, but this will happen in the near future.

Performance Issue: Initial Greeting Latency in Telephony Agent Pipeline

A few questions: - Are you running locally or deployed? - Is this a dial-in or dial-out use case? (I'm assuming that you're dialing in to the bot based on...

Audio Mixer 100% CPU Spike On Twilio Client Disconnected

Mind submitting this in PR format so we can see the diff?

modify Gemini Live transcript if server_content contains thought=true

Can you explain what you're looking to accomplish?

`SmallWebRTCConnection` with Docker

@filipi87 maybe you have an idea?

`SmallWebRTCConnection` with Docker

We're going to be doing some work in the next few weeks to add SmallWebRTC to Pipecat Cloud. In doing so, we'll be running in a Docker container. We'll share...