Juliusz Chroboczek
Juliusz Chroboczek
I'm open to the idea, but I'd need to speak with the people who actually need the feature. In particular, I'd need to understand why they don't use a system-wide...
Ah-ha, you're thinking of server-side TTS. Yes, that makes more sense. I think this could by done by writing a separate client that connects to the Galene server and does...
I've got a very early prototype. On my laptop, it takes 290% CPU to transcribe a single stream in real time, and on the order of 500MB of RAM. That's...
I've done some more experimenting. The smallest model available "tiny_q5.1", runs in real-time on my laptop, but the quality is not useful (it mostly produces hallucinations). The base.en model almost...
Increasing the segment size to 5s improves the results quite a bit. A better segmentation strategy is needed.
After some more tweaking, it's sort of usable. Please see https://github.com/jech/galene-stt
Comment by @davidzhao on the Pion channel: > it's possible to not request microphone for autoplay, but it's complicated. > The solution we found was to call play() explicitly and...
The change is a net positive, it only adds more data to previously unused fields. On the other hand, I don't think it solves any of the fundamental issues we...
The obvious solution is to add a method `WriteRTX` to `TrackLocalStaticRTP`, which I'm quite willing to do. Any better ideas?
Joe, we're only just getting to grips with v4. If you change everything, you're going to lose a lot of people.