whisper.cpp icon indicating copy to clipboard operation
whisper.cpp copied to clipboard

Different platforms return different results

Open WYC-2020 opened this issue 1 year ago • 3 comments

Use the large-v3 model to extract subtitles from the same audio file. Subtitles will be lost under Ubuntu.

./main -m ggml-large-v3.bin -f 1.wav -l zh -osrt 1

In the picture below, Windows is on the left and Ubuntu is on the right.

img_v3_02b3_a6884852-4756-4662-80c4-0fa78cf1ef4g test-audio.zip

WYC-2020 avatar May 21 '24 08:05 WYC-2020

Also seeing some mild differences between the outcome of large_v3 depending on which CPU whisper is run on (in openvino plugin), AMD and Intel CPUs seem to produces slightly different outcomes even when both are run on WIndows. Differences could be word capitalization, interpretation of things like names and acronyms etc. These are for English.

The3IC avatar May 21 '24 19:05 The3IC

I just tried the latest version and it is the same on both sides, but the result is wrong and some words are missing. image

WYC-2020 avatar May 22 '24 04:05 WYC-2020

中华汉语博大精深。😂

Chinese is a profound language.

sdlyyxy avatar Sep 03 '24 14:09 sdlyyxy