RealtimeSTT icon indicating copy to clipboard operation
RealtimeSTT copied to clipboard

Transcription isn't triggered in absence of frames

Open axtgr opened this issue 8 months ago • 2 comments

I'm using the websocket server with a remote TypeScript client. The client uses its own simplistic VAD filter to avoid sending frames with no voice. Because of this filter, final transcription isn't done as there is simply no frame to trigger it. Removing the client filter solves the issue, but it's less than ideal because I'd like to avoid continuously sending frames over the wire. Perhaps a timeout could be added to the recorder that triggers the time checks in case no frames have been received?

axtgr avatar Jun 19 '25 14:06 axtgr

Maybe, yes. Unsure if this add much to RealtimeSTT. You could just send silence in this case.

KoljaB avatar Jun 19 '25 20:06 KoljaB

Sure, but this is somewhat hacky and unexpected. I spent quite some time figuring out why transcription isn't triggering on silence only to realize that I'm too silent. I think this at least warrants a mention in the readme.

axtgr avatar Jul 02 '25 08:07 axtgr