ipex-llm icon indicating copy to clipboard operation
ipex-llm copied to clipboard

Determining if AMX is in use by ollama

Open js333031 opened this issue 1 year ago • 6 comments

Hello, I used latest steps to install ipex-llm into a venv on a 5th Gen Xeon system. I don't think AMX is being utilized based on screenshot below. Should AMX show up in list of CPU features in output below - last line? lscpu shows AMX instructions are present. Can you please show a verification step for ipex-llm confirming that CPU optimizations are installed in ipex-llm venv?

image

Thanks

js333031 avatar Jul 03 '24 01:07 js333031

Hi @js333031 , ipex-llm ollama is a GPU-optimized version, so we currently do not plan to work on this.

sgwhat avatar Jul 04 '24 01:07 sgwhat

Has there been any change to the plan for optimizing for Xeon/AMX?

js333031 avatar Oct 03 '24 21:10 js333031

Sorry, we don't have any plan for optimizing ollama for Xeon/AMX yet.

glorysdj avatar Oct 08 '24 02:10 glorysdj

I am also querying about this, but and I have noticed that its supported by the llama_cpp package, so it just depends I think on how downstream they are from changes made to the base package.

endomorphosis avatar Nov 10 '24 20:11 endomorphosis

any update on this?

tjwebb avatar Jun 28 '25 17:06 tjwebb

I have not yet tried but this is one way to trace: https://www.intel.com/content/www/us/en/docs/onednn/developer-guide-reference/2025-0/verbose-mode.html

For PyTorch apps using oneDNN: enable env variable as shown in https://docs.pytorch.org/tutorials/recipes/amx.html

js333031 avatar Jul 13 '25 15:07 js333031