Purfview
Purfview
From the benchmarks posted in this thread you can see that VAD runs 134 audio seconds/s, and that's on the ancient CPU. You can use `window_size_samples=1536` to make VAD faster....
> we want to measure the performance in percentage, therefore `t_j / t_i` is calculated. Now it shows something like a car's speed in percentage relative to a speed of...
> Did you benchmark the performance w.r.t CPU threads? I didn't noticed any impact when adjusting options related to threads.
1) Whisper's model can just miss something in transcription for no apparent reason 2) A one byte change in audio can trigger a different result 3) A one token change...
This is a bit wrong repo to post this. There it's more suitable place: https://github.com/Purfview/whisper-standalone-win/discussions Btw, Standalone Faster-Whisper doesn't need this repo installed, as you wrote in your requirements.
I think you should post this at ctranslate2 repo.
>This started happening with the new version, not a problem with the prior version. processing multiple files using one click.bat. When I resubmit the erroed file in the oneclick it's...
>Itried your suggestion with both false an true options and didn't make a difference, Didn't it fail on the first file when "--model_preload true"?
>I am suspisious that the the vram is not released when the 1st file completed. That error is about RAM. >This started happening with the new version, not a problem...
>I don't remember now if it should be released (from whisper model) and I don't remember how this option works: --model_preload By default, if "--ff_vocal_extract" is in use then "--model_preload"...