llama-cpp-python Workflow Example for CUDA 12.6 Windows

Workflow Example for CUDA 12.6 Windows

Open lyxkilo opened this issue 9 months ago • 0 comments

For those trying to use GitHub Actions to build the latest version (v0.3.7) with CUDA 12.6 for Windows but failed, maybe installing VS version >=17.12 and CUDA directly like here can solve the issue, here is an example workflow.

P.S. change the value of CMAKE argument "DCMAKE_CUDA_ARCHITECTURES" to your local CUDA arch instead of "all" in the original workflow can reduce a lot of compilation time. https://cmake.org/cmake/help/latest/prop_tgt/CUDA_ARCHITECTURES.html https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/#gpu-feature-list

Mar 01 '25 12:03 lyxkilo

llama-cpp-python llama-cpp-python copied to clipboard

Workflow Example for CUDA 12.6 Windows

llama-cpp-python
llama-cpp-python copied to clipboard