Eric Curtin comments

Results 479 comments of


                                            Eric Curtin

Pulling models from private OCI Registries

We will be enabling the functionality to push and pull models to OCI registries in RamaLama pending completion of the new "podman artifact" command: https://github.com/containers/ramalama

EPIC: Support for vLLM

> Tried again as I see gtx1100 listed on vllm build docs. It wasn't smooth but got it running at least locally on Fedora 41 and https://repo.radeon.com/rocm/el9/6.3.4/ packages. No time...

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Users have hit this in RamaLama also: ``` Attempted to download Gemma3 from Ollama registry with ramalama run gemma3 Name pulled from https://www.ollama.com/library/gemma3 Got an error when running ramalama run...

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Just tagging @ochafik and @jan-wassenberg for awareness

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Likely related: https://github.com/ggml-org/llama.cpp/issues/12857 Is this Ollama-specific? The above issue doesn't seem to be an Ollama model

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

I do think we should try and fix this one way or another, gemma3 is a very popular model: ``` $ ramalama run gemma3 Loading modelllama_model_load: error loading model: error...

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

I'm pointing people to these instead for now: https://github.com/containers/ramalama/pull/1288/files

Eric Curtin

Pulling models from private OCI Registries

EPIC: Support for vLLM

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Eval bug: Loading fail on Gemma 3:12b > llama_model_load: error loading model: error loading model hyperparameters: key not found in model: gemma3.attention.layer_norm_rms_epsilon

Llama.cpp fails on Fedora AMD - ROCm error

Llama.cpp fails on Fedora AMD - ROCm error

Llama.cpp fails on Fedora AMD - ROCm error