llama.cpp issues

[CANN] Compile bug: no matching function for call to 'CastIntrinsicsImpl' Ascend NPU issues specific to Ascend NPUs

1

I encountered the same issue(#10556 ) in Ascend310B1 as well. ``` root@orangepiaipro-20t:/data/llama.cpp# cmake -B build -DGGML_CANN=on -DCMAKE_BUILD_TYPE=release -- Warning: ccache not found - consider installing it for faster compilation or...

Cikaros

Add Doc for Converting Granite Vision -> GGUF

Adds example docs for converting a granite vision model, which is essentially a llava next model with multiple feature layers using siglip for the visual encoder, and a granite language...

alex-jw-brooks

examples

Misc. bug: Vulcan premature out of memory exception on AMD Instinct MI60

9

### Name and Version llama-cli --version ggml_vulkan: Found 1 Vulkan devices: ggml_vulkan: 0 = AMD Radeon Graphics (RADV VEGA20) (radv) | uma: 0 | fp16: 1 | warp size: 64...

dazipe

bug-unconfirmed

feat: support internvl

10

- [x] I have read the [contributing guidelines](https://github.com/ggerganov/llama.cpp/blob/master/CONTRIBUTING.md) - Self-reported review complexity: - [ ] Low - [x] Medium - [ ] High

qlylangyu

examples

python

ggml-cpu: Support s390x SIMD Instruction Set

1

This pull request aims to integrate the SIMD instruction set via `vecintrin.h` into llama.cpp on the s390x platform. Currently the SIMD instruction set is included in the following `ggml_vec_dot` functions:...

taronaeo

ggml

CUDA: app option to compile without FlashAttention

Fixes https://github.com/ggml-org/llama.cpp/issues/11946 . I added an option `GGML_CUDA_NO_FA` that is used for CUDA, HIP, and MUSA. Two more general questions for compile options: * Do we have guidelines regarding whether...

JohannesGaessler

Nvidia GPU

ggml

Eval bug: unknown pre-tokenizer type: 'deepseek-r1-qwen'

1

### Name and Version $./llama-cli --version version: 3680 (947538ac) built with cc (Debian 14.2.0-16) 14.2.0 for x86_64-linux-gnu ### Operating systems Linux ### GGML backends CPU ### Hardware Intel Celeron 1007U...

wr131

bug-unconfirmed

Misc. bug: llama-run segmentation fault

### Name and Version ``` version: 4754 (de8b5a36) built with Apple clang version 16.0.0 (clang-1600.0.26.6) for arm64-apple-darwin24.2.0 ``` but also reproducing on the current main branch ### Operating systems Mac...

benoitf

bug-unconfirmed

llama.cpp
llama.cpp copied to clipboard

Metadata

[CANN] Compile bug: no matching function for call to 'CastIntrinsicsImpl' Ascend NPU issues specific to Ascend NPUs

Add Doc for Converting Granite Vision -> GGUF

Misc. bug: Vulcan premature out of memory exception on AMD Instinct MI60

feat: support internvl

ggml-cpu: Support s390x SIMD Instruction Set

CUDA: app option to compile without FlashAttention

Eval bug: unknown pre-tokenizer type: 'deepseek-r1-qwen'

Misc. bug: llama-run segmentation fault

← Metadata

Owner

Metadata

llama.cpp llama.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama.cpp
llama.cpp copied to clipboard