whisper.cpp
whisper.cpp copied to clipboard
Timestamps skip leading silence
For some reason, transcripts always start at 0s, even when there's leading silence.
P.S. It might be nicer to just handle multiple audio channels with overlapping speech gracefully ;)
It's a limitation of the model
https://github.com/openai/whisper/discussions/375