Whisper-WebUI
Whisper-WebUI copied to clipboard
Timestamps always maximum length when using Silero VAD
Transcription appears to be accurate, however the ending timestamps for each line are always set at the beginning timestamp of the next line, resulting in subtitles constantly displayed long after speech ends, e.g.:
37
00:09:44,419 --> 00:09:56,950
I can't solve the problem at this rate.
38
00:09:56,950 --> 00:10:07,269
What should I do?
39
00:10:07,269 --> 00:10:11,269
I'll take my time and look at it.
``