llama-cpp-python
llama-cpp-python copied to clipboard
Workflow Example for CUDA 12.6 Windows
For those trying to use GitHub Actions to build the latest version (v0.3.7) with CUDA 12.6 for Windows but failed, maybe installing VS version >=17.12 and CUDA directly like here can solve the issue, here is an example workflow.
P.S. change the value of CMAKE argument "DCMAKE_CUDA_ARCHITECTURES" to your local CUDA arch instead of "all" in the original workflow can reduce a lot of compilation time. https://cmake.org/cmake/help/latest/prop_tgt/CUDA_ARCHITECTURES.html https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/#gpu-feature-list