whisper.cpp icon indicating copy to clipboard operation
whisper.cpp copied to clipboard

unexpected output with word level timestamps & dual channel (stereo)

Open eschmidbauer opened this issue 1 year ago • 2 comments

Hi, thank you for recently adding stereo (dual channel) support. It works great with utterance level timestamps but when i use: -ml 1 to do word level timestamps, I get some unexpected output.

Please see the attached gist for output

Here is the wav file i used (i used ffmpeg to convert it to 16k ffmpeg -i commercial_stereo.wav -ar 16000 commercial_stereo_16k.wav)

eschmidbauer avatar Nov 28 '22 16:11 eschmidbauer