whisper.cpp icon indicating copy to clipboard operation
whisper.cpp copied to clipboard

Timestamps skip leading silence

Open luke-jr opened this issue 2 years ago • 1 comments

For some reason, transcripts always start at 0s, even when there's leading silence.

P.S. It might be nicer to just handle multiple audio channels with overlapping speech gracefully ;)

luke-jr avatar Dec 24 '22 03:12 luke-jr

It's a limitation of the model

https://github.com/openai/whisper/discussions/375

ggerganov avatar Dec 29 '22 11:12 ggerganov