ipex-llm icon indicating copy to clipboard operation
ipex-llm copied to clipboard

gemma3 and qwen3

Open ybeetle8 opened this issue 7 months ago • 14 comments

I hope to add and adapt two useful models, gemma3 and qwen3, to the ollama function.

ybeetle8 avatar Apr 29 '25 10:04 ybeetle8

qwen3 works with the latest nightly build of ollma (ollama-ipex-llm-2.3.0b20250428-win.zip) on my Ultra 258V: https://github.com/ipex-llm/ipex-llm/releases/tag/v2.3.0-nightly

nebulakid avatar Apr 29 '25 15:04 nebulakid

Hi @ybeetle8 , qwen3 model has been supported in our latest version. You may install it via pip install --pre --upgrade ipex-llm[cpp], see https://github.com/intel/ipex-llm/blob/main/docs/mddocs/Quickstart/ollama_quickstart.md for more usage details.

sgwhat avatar Apr 30 '25 01:04 sgwhat

@nebulakid It does not work on Ubuntu

vsundupey avatar Apr 30 '25 05:04 vsundupey

Where is “ollama-ipex-llm-2.3.0b20250428-ubuntu.tgz“”?

yizhangliu avatar Apr 30 '25 06:04 yizhangliu

Where is “ollama-ipex-llm-2.3.0b20250428-ubuntu.tgz“”?

See https://github.com/intel/ipex-llm/blob/main/docs/mddocs/Quickstart/ollama_portable_zip_quickstart.md

jason-dai avatar Apr 30 '25 08:04 jason-dai

qwen3适用于我的 Ultra 258V 上的最新 Nightly Build 的 ollma (ollama-ipex-llm-2.3.0b20250428-win.zip):https://github.com/ipex-llm/ipex-llm/releases/tag/v2.3.0-nightly

Is Qwen3 really supported?

fangzm0805 avatar May 05 '25 02:05 fangzm0805

我希望将两个有用的模型 gemma3 和 qwen3 添加并调整到 ollama 函数中。

Me too.

fangzm0805 avatar May 05 '25 02:05 fangzm0805

qwen3适用于我的 Ultra 258V 上的最新 Nightly Build 的 ollma (ollama-ipex-llm-2.3.0b20250428-win.zip):https://github.com/ipex-llm/ipex-llm/releases/tag/v2.3.0-nightly

No, it reports the error "Error: llama runner process has terminated: exit status 1"

fangzm0805 avatar May 05 '25 04:05 fangzm0805

Where is “ollama-ipex-llm-2.3.0b20250428-ubuntu.tgz“”?

uploaded.

qiuxin2012 avatar May 06 '25 02:05 qiuxin2012

I tried Qwen3 and it can provide me an answer but only once. If I ask a second question, nothing happens - I don't see anything and have to restart.

wouldntyouknow avatar May 09 '25 11:05 wouldntyouknow

After intel driver updates I got working Qwen3 (32b) on v2.3.0-nightly for Ubuntu

vsundupey avatar May 10 '25 11:05 vsundupey

On the latest nightly build for Arc 750, all models run slower than on the CPU, despite the GPU load being at 100%. Is it only like this on my machine? Windows one I mean, latest driver.

AlexXT avatar May 17 '25 09:05 AlexXT

Hi all, gemma3 ollama is supported from ipex-llm[cpp]==2.3.0b20250529 . You could try it again with pip install ipex-llm[cpp]==2.3.0b20250529.

rnwang04 avatar May 30 '25 05:05 rnwang04

Confirmed that both qwen3 and gemma3 models work with https://github.com/ipex-llm/ipex-llm/releases/download/v2.3.0-nightly/ollama-ipex-llm-2.3.0b20250612-ubuntu.tgz.

charlescng avatar Jun 14 '25 14:06 charlescng