llama-cpp-python issues

Update README.md

4

Cannot install current version of llama-cpp-python 0.3.16 on Windows (backend independent)

7

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code (`Version 0.3.16`). - [x] I carefully followed the [README.md](https://github.com/abetlen/llama-cpp-python/blob/main/README.md)....

devtobi

Updated ROCm installation instructions

1

The updated installation instructions allow the utilization of the GPU, as per the [upstream instructions](https://github.com/ggerganov/llama.cpp/blob/master/docs/build.md#hip).

agronholm

Access Violation issue facing for exe created using pyinstaller

2

I am trying to create a executable for one of the python script. When I try to run the application created using pyinstaller I am getting an error while trying...

maniron214

Can't compute multiple embeddings in a single call

3

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged...

jeberger

Better Qwen2.5-VL chat template.

alcoftTAO

llama_get_kv_self debug symbols removed

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged...

Bread7

ggml_cuda_init: failed to initialize CUDA: (null) on Windows with CUDA 12.9

2

System Information: * OS: Windows * GPU: NVIDIA GeForce RTX 5060 Ti * NVIDIA Driver Version: 577.00 * CUDA Version (from `nvidia-smi`): 12.9 * Python Version: 3.12 * Visual Studio:...

sequeirawilson2021

Thinking toggle support for Qwen related models

**Is your feature request related to a problem? Please describe.** Cannot toggle thinking in Qwen models, when we do it through the user prompt way, it still gives out opening...

Kishlay-notabot

Building and installing llama_cpp from source for RTX 50 Blackwell GPU

2

--- ### My Journey to Building `llama-cpp-python` with CUDA on an RTX 5060 Ti (Blackwell Architecture) This guide details the steps I took to successfully install `llama-cpp-python` with full CUDA...

Johnnyboycurtis

llama-cpp-python
llama-cpp-python copied to clipboard

Metadata

Update README.md

Cannot install current version of llama-cpp-python 0.3.16 on Windows (backend independent)

Updated ROCm installation instructions

Access Violation issue facing for exe created using pyinstaller

Can't compute multiple embeddings in a single call

Better Qwen2.5-VL chat template.

llama_get_kv_self debug symbols removed

ggml_cuda_init: failed to initialize CUDA: (null) on Windows with CUDA 12.9

Thinking toggle support for Qwen related models

Building and installing llama_cpp from source for RTX 50 Blackwell GPU

← Metadata

Owner

Metadata

llama-cpp-python llama-cpp-python copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama-cpp-python
llama-cpp-python copied to clipboard