whisperX
whisperX copied to clipboard
Streaming with whisperx
Is there a repo or code that allows for real-time streaming with whisperx? Thank you!
Whisper itself can't stream, so I dont think so, unfortunately
This issue from whisper.cpp comes to mind Support for realtime audio input . It highlights some issues with doing realtime transcription with whisper in general.
Would be great to add support for streaming, because folks have been using Whisper.cpp successfully and implemented streaming - for example gladia.io.
This submission will allow for segmented streaming using Whisperx, improving client response time, but not for underlying streaming implementations.
Pull request here
So this can now be used to transcribe live from an audiostream like mic input? If yes, do you maybe know how?
any update?
Whisper itself can't stream, so I dont think so, unfortunately
I assume it is possible, sort of, because whisperx splits the audio to chunks, it can proccess each chunk individually and stream it after finished, instead of waiting for all chunks.
would be nice feature.