faster-whisper Is it possible to add audio context length parameter like in whisper.cpp

Is it possible to add audio context length parameter like in whisper.cpp

Open fquirin opened this issue 1 year ago • 0 comments

I'm working mostly with very small files like 2-6s and what I noticed many times in Whisper is that transcription time is not linear. What I mean is that 3s and 6s files will be processed in almost the same time. I don't know the inner parts of Whisper well enough but I think this is related to the 30s window used to process chunks.

In whisper.cpp there is a parameter called audio_ctx that has been described here: https://github.com/ggerganov/whisper.cpp/issues/137

Do you think it might be possible to implement something similar in faster-whisper so that we can experiment with it and see if it reduces processing time for small files?

Apr 23 '23 18:04 fquirin

faster-whisper faster-whisper copied to clipboard

Is it possible to add audio context length parameter like in whisper.cpp

faster-whisper
faster-whisper copied to clipboard