Mark Backman
Mark Backman
Hi @Kharacternyk, the `AudioContextTTSService` class handles a websocket case where there can be multiple streams returned at the same time. This is unique to only Websocket services. If your service...
Got it. Yes, the base classes for the TTS services are complex. There are so many permutations of how services are built. Tagging @aconchillo for thoughts on how to proceed.
We're working on this. We've added a few items already: - RealtimeInputConfig - ContextWindowCompression - MediaResolution We'll add more over time.
> Can you also add Language params ? Flash 2.0 001 supports various language codes that introduce subtle accents which are great for deploying regional voice agents `language` is already...
It sounds like your custom FrameProcessor(s) are blocking the StartFrame. Make sure that your FrameProcessors push all frames. Read more about it here: https://docs.pipecat.ai/guides/fundamentals/custom-frame-processor Note: The StartFrame must traverse the...
Filipi just spent time fixing issues in Flux here: https://github.com/pipecat-ai/pipecat/pull/3144 Closing.
Thanks for the detailed report. 🙇 Tagging @filipi87 to take a look when he gets a moment.
Hey @krishvadhani19 I just set up an example with this and unfortunately, it doesn't run. Does this work for you? Also, in comparing to the docs, I see a sample...
For us to accept a submission, it would have to mimic the other TTS services. A good one to look at would be [CartesiaTTSService](https://github.com/pipecat-ai/pipecat/blob/main/src/pipecat/services/cartesia/tts.py#L75). The only difference between Resemble.ai and...
@krishvadhani19 can you reply, otherwise I'll close this out due to inactivity.