llama-cpp-python llama-server not using GPU

llama-server not using GPU

Open RakshitAralimatti opened this issue 1 year ago • 5 comments

After I install llama-cpp-python-server with cuda support and run python3 -m llama_cpp.server --model starcoderbase-3b/starcoderbase-3b.Q4_K_M.gguf --n_gpu_layers 10 The GPU is not getting used its running on the CPU

Nov 09 '24 04:11 RakshitAralimatti

llama-cpp-python llama-cpp-python copied to clipboard

llama-server not using GPU

llama-cpp-python
llama-cpp-python copied to clipboard