Mukul Tripathi comments

Results 16 comments of


                                            Mukul Tripathi

Add Intel Advanced Matrix Extensions (AMX) support to ggml

If I have a Sapphire Rapids processor which is AMX enabled, how do i ensure that I have them enabled in llama.cpp? currently I am building it with ```bash cmake...

[Feature] support Qwen3-235B-A22B-Instruct-2507

> Please DO NOT ADD --**cache_lens** If i do not specify cache_lens then i am restricted to 16k length. how do i specify 256k context length?

[Bug] can't deepseek 0528 version

Here is the step by step tutorial to run it: https://www.youtube.com/watch?v=Xui3_bA26LE and here is the written guide: https://github.com/Teachings/AIServerSetup/blob/main/06-DeepSeek-R1-0528/01-DeepSeek-R1-0528-KTransformers-Setup-Guide.md Note: I have been unable to run it on .3.0 or .3.1...

[Bug] can't deepseek 0528 version

Can you share CUDA version, nvcc and step by step on which commands you ran to build it? I can try to reproduce it and find a fix.

[Bug] can't deepseek 0528 version

What command did you use for qwen3 to start the server?

[Bug] can't deepseek 0528 version

> > Here is the step by step tutorial to run it: https://www.youtube.com/watch?v=Xui3_bA26LE and here is the written guide: https://github.com/Teachings/AIServerSetup/blob/main/06-DeepSeek-R1-0528/01-DeepSeek-R1-0528-KTransformers-Setup-Guide.md > > Note: I have been unable to run it...

Mukul Tripathi

Add Intel Advanced Matrix Extensions (AMX) support to ggml

[Feature] support Qwen3-235B-A22B-Instruct-2507

[Bug] can't deepseek 0528 version

[Bug] can't deepseek 0528 version

[Bug] can't deepseek 0528 version

[Bug] can't deepseek 0528 version

Settings page on mobile

Settings page on mobile

Feature Request: support intel amx for further accelerate

Feature Request: support intel amx for further accelerate