llama-cpp-python icon indicating copy to clipboard operation
llama-cpp-python copied to clipboard

Python bindings for llama.cpp

Results 424 llama-cpp-python issues
Sort by recently updated
recently updated
newest added

**Is your feature request related to a problem? Please describe.** In RAG-scenarious, I think it would be a great help to differentiate if a LLM is hallucinating or retrieving its...

enhancement
question

Hey, I've been struggling for a month to install the latest version with CUDA. It was a nightmare. So here is the guide how to do that. tldr docker syntax:...

After I install `llama-cpp-python-server` with cuda support and run `python3 -m llama_cpp.server --model starcoderbase-3b/starcoderbase-3b.Q4_K_M.gguf --n_gpu_layers 10 ` The GPU is not getting used its running on the CPU

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged...