Mark Backman
Mark Backman
The trickiest part of the problem is detecting which language is being spoken in the first place. The best solution would be to select an STT solution that can transcribe...
One pipeline should suffice. You could also use a multimodal LLM like Gemini Live for this to take audio in and translate. I would imagine it will be up for...
Yes! GladiaSTTService has a really nice implementation that makes translation easy. In fact, Pipecat now yields a TranslationFrame in addition to the TranscriptionFrame when enabling this for their service. Check...
It looks like you're mixing ways of setting tools. You need to either use the constructor or use the context. The current best practice is to use the LLMContext, which...
In talking with the HeyGen team, we'll work to migrate to LiveAvatars. They're not ready for migration yet, but this will happen in the near future.
A few questions: - Are you running locally or deployed? - Is this a dial-in or dial-out use case? (I'm assuming that you're dialing in to the bot based on...
Mind submitting this in PR format so we can see the diff?
Can you explain what you're looking to accomplish?
@filipi87 maybe you have an idea?
We're going to be doing some work in the next few weeks to add SmallWebRTC to Pipecat Cloud. In doing so, we'll be running in a Docker container. We'll share...