llama-cpp-python issues

Metal installation documentation

I tried setting up `llama-cpp-python` in the current version `0.3.7` on my MacBook M4 Pro. In the first step I only installed via `pip install llama-cpp-python --no-cache-dir` without specifiying the...

devtobi

Issue with Installing llama-cpp-python 0.3.7: Dependency Problems with scikit-build-core

**Issue:** I am trying to update llama-cpp-python to the latest version (0.3.7) to test the latest Distill version models of DeepSeek, such as DeepSeek r1 Distil Qwen. However, I am...

lcnmzz00

After choosing to offload all layers onto the GPU, the Ram used for model loading is not released

My graphics card was RTX3060 12G, the model used was Qwen2.5-7B-instruct-Q4_k_M, normally the model should only take up 4~5G VRam, so I thought the VRam of my GPU was sufficient...

MATII13T

chore(deps): update starlette-context requirement from <0.4,>=0.3.6 to >=0.3.6,<0.5

Updates the requirements on [starlette-context](https://github.com/tomwojcik/starlette-context) to permit the latest version. Release notes Sourced from starlette-context's releases. v0.4.0 Drop support for Python 3.8. Now only 3.9+ is supported ( tomwojcik/starlette-context#152 )...

dependabot[bot]

dependencies

python

Prebuild WHL, Cuda 12.1 win10 python 3.10

Hello, I've been trying to build llama-cpp-python latest version (0.3.7)for a week for Deepseek R1. But i failed. Whenever i did succesfully it errors(access violation 0x000000) when import and run....

guney45

openai API `n` argument is ignored

1

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [X] I am running the latest code. Development is very rapid so there are no tagged...

BenjaminMarechalEVITECH

successfully installed !! CUDA/v12.6 !! Visual Studio 2022!!

3

I've just successfully installed it! Here's the information for your reference. PowerShell ： $env:CUDA_TOOLKIT_ROOT_DIR="C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v12.6" $env:CMAKE_GENERATOR_PLATFORM="x64" $env:FORCE_CMAKE="1" $env:CMAKE_ARGS="-DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=89" pip install llama-cpp-python --no-cache-dir --force-reinstall --upgrade ********************************************************************** **...

dw5189

Workflow Example for CUDA 12.6 Windows

For those trying to use GitHub Actions to build the latest version (v0.3.7) with CUDA 12.6 for Windows but failed, maybe installing VS version >=17.12 and CUDA directly like [here](https://github.com/Jimver/cuda-toolkit?tab=readme-ov-file#example-usage)...

lyxkilo

OK!!! ggml_cuda_init: found 1 CUDA devices:

1

(base) PS C:\WINDOWS\system32> conda activate CUDA124-py312 (CUDA124-py312) PS C:\WINDOWS\system32> $env:CUDA_TOOLKIT_ROOT_DIR="C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v12.4" (CUDA124-py312) PS C:\WINDOWS\system32> $env:CMAKE_GENERATOR_PLATFORM="x64" (CUDA124-py312) PS C:\WINDOWS\system32> $env:FORCE_CMAKE="1" (CUDA124-py312) PS C:\WINDOWS\system32> $env:CMAKE_ARGS="-DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=89" (CUDA124-py312) PS C:\WINDOWS\system32>...

dw5189

更新llama cpp，目前不支持deepseek r1以及蒸馏模型

**Is your feature request related to a problem? Please describe.** A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] **Describe the solution you'd...

hendrymax

llama-cpp-python
llama-cpp-python copied to clipboard

Metadata

Metal installation documentation

Issue with Installing llama-cpp-python 0.3.7: Dependency Problems with scikit-build-core

After choosing to offload all layers onto the GPU, the Ram used for model loading is not released

chore(deps): update starlette-context requirement from <0.4,>=0.3.6 to >=0.3.6,<0.5

Prebuild WHL, Cuda 12.1 win10 python 3.10

openai API `n` argument is ignored

successfully installed !! CUDA/v12.6 !! Visual Studio 2022!!

Workflow Example for CUDA 12.6 Windows

OK!!! ggml_cuda_init: found 1 CUDA devices:

更新llama cpp，目前不支持deepseek r1以及蒸馏模型

← Metadata

Owner

Metadata

llama-cpp-python llama-cpp-python copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama-cpp-python
llama-cpp-python copied to clipboard