Georgi Gerganov comments

Results 420 comments of


Georgi Gerganov

Multi Threaded issue with CUDA compiled library 1.5.4

Ok, you can track the state of `ggml` becoming thread-safe in this issue: https://github.com/ggerganov/llama.cpp/issues/3960 Sorry for the inconvenience - you might want to stick with whisper.cpp v1.5.1 for now

Fix the decoding issues

Does this change have any positive effect for `whisper-v3` or it's still repeating stuff?

Fix the decoding issues

Did you run some tests?

Finetuning models for audio_ctx support

Wow! This looks like a very important work. Would love to give this a try at some point Any reason to prefer `-ac 500` over `-ac 512`? Round numbers are...

perf regression compared with v1.4.0

Use `-bs 1` to get the old speed. The quality with more beams in general should be better, but it's possible that you don't observe much of a difference

whisper entering a transscription loop / possible regression

Do not use large-v3. I haven't checked what is the status recently, but last time I did this model did not work well even with the original OpenAI repo. Try...

100% crash when calling whisper_init_from_file_with_params with Address Sanitizer enabled (Xcode, ggml-large-v2) on both use_gpu={0,1}

Hm, I can't reproduce on my Mac Studio: ``` ./bin/main -m ../models/ggml-large-v2.bin -f ../samples/gb0.wav --no-gpu whisper_init_from_file_with_params_no_state: loading model from '../models/ggml-large-v2.bin' whisper_model_load: loading model whisper_model_load: n_vocab = 51865 whisper_model_load: n_audio_ctx =...

Georgi Gerganov

Multi Threaded issue with CUDA compiled library 1.5.4

Fix the decoding issues

Fix the decoding issues

Finetuning models for audio_ctx support

perf regression compared with v1.4.0

whisper entering a transscription loop / possible regression

100% crash when calling whisper_init_from_file_with_params with Address Sanitizer enabled (Xcode, ggml-large-v2) on both use_gpu={0,1}

Dynamic CUDA driver loader

CoreML: MLModel of type mlProgram cannot be loaded just from the model spec object.

Add ability to limit auto-detection to a subset of languages