SONG Ge comments

Results 166 comments of


                                            SONG Ge

Phi3 3.8B mini 128k model not supported

Hi @lumurillo , we have reproduced this issue and will inform you when we make progress.

Instructions for Ollama Installation with IPEX on Multi-GPU Fail to Work with Arc GPU

Hi @Kaszanas, you will not see any intel gpu info until you loading a model.

transformers 4.38.1 gives bad llama3 performance on MTL iGPU

Hi @Cbaoj , you may use `transformers==4.38.2` to get a better performance, we are working on optimizing llama model performance on 4.38.x.

ollama on windows nightly build/portable zip

Hi @publicarray, we currently do not support Gemma3, and our support is still working on it. We recommend switching to other models such as Qwen3 or DeepSeek-R1 in the meantime.

Looking for a workaround to install IPEX-LLM on Windows with an Intel GPU but running with a CPU and not running with a GPU

Hi @Fucalors , 1. Could you please provide the complete runtime log of the Ollama server side during model inference? 2. Could you please run `ls-sycl-device.exe` and reply us the...

Looking for a workaround to install IPEX-LLM on Windows with an Intel GPU but running with a CPU and not running with a GPU

Hi @Fucalors, I don't think you are running ipex-llm ollama. Please double-check your environment and installation method. You may refer to our documentation at https://ipex-llm-latest.readthedocs.io/en/latest/doc/LLM/Quickstart/ollama_quickstart.html for installing Ollama.

The Olama version is outdated and cannot load the model

> +1 current version is 0.9.x and latest version of ollama supports embedding model at 0.12.x Sry @junesg, but we are not developing ollama currently.

GPT-OSS 120B ERROR llama.cpp ipex-llm==2.3.0b20251104

Not sure. I am not working on developing it recently.

It is hoped that the qwen3 series models can be supported quickly

@brownplayer, qwen3 model has been supported in our latest version. You may install it via `pip install --pre --upgrade ipex-llm[cpp]`, see https://github.com/intel/ipex-llm/blob/main/docs/mddocs/Quickstart/ollama_quickstart.md for more usage details.

What is the current ollama version

v0.5.4, and we are working on releasing v0.6.2 for linux first.