Georgi Gerganov

Results 420 comments of Georgi Gerganov

@abetlen Looking great! My impression is that constrained sampling is under-utilized today and there are many cool applications of this approach that are yet to be demonstrated.

Relevant: - https://twitter.com/GrantSlatton/status/1657559506069463040 - https://twitter.com/GrantSlatton/status/1637692033115762688

https://github.com/ggerganov/llama.cpp/pull/1887

Yes, this is expected. There should be no whitespace at the end. See this comment for more information: https://github.com/ggerganov/llama.cpp/pull/1297#issuecomment-1533314364

> The idea that I'm having in my mind is maybe hard code some template patterns to detect if it's which type of template. In reality, we will mostly have...

I think this looks like a similar task as the one I did for the GPT-J model: https://github.com/ggerganov/ggml/tree/master/examples/gpt-j See the convert script there. If somebody wants to take a shot...

I added a conversion script: https://github.com/ggerganov/whisper.cpp/blob/master/models/convert-h5-to-ggml.py Use like this: ``` git clone https://github.com/openai/whisper git clone https://github.com/ggerganov/whisper.cpp git clone https://huggingface.co/openai/whisper-medium python3 ./whisper.cpp/models/convert-h5-to-ggml.py ./whisper-medium/ ./whisper . ``` However, just as you noticed,...

So looking a bit more into this, I think that `proj_out.weight` is not actually used: https://github.com/huggingface/transformers/blob/9a5b84a0076a04fe9596da72e8668069d4f09ea0/src/transformers/models/whisper/modeling_whisper.py#L1099-L1106 Looking at this [discussion](https://github.com/openai/whisper/discussions/64#discussioncomment-4218667), it seems like people are still struggling to make the...

Great! I confirm that it works now. Tried both models: - https://huggingface.co/openai/whisper-medium - https://huggingface.co/openai/whisper-base.en > Unsure about which is decode and encode here too (its the same value) First is...

@luigisaetta Hi, thanks for the help! We figured it out yesterday together with @jzju and we can now use HF models with `whisper.cpp`. @abelbabel Yes, that model is the same....