llama-cpp-python icon indicating copy to clipboard operation
llama-cpp-python copied to clipboard

Python bindings for llama.cpp

Results 424 llama-cpp-python issues
Sort by recently updated
recently updated
newest added

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ X] I am running the latest code. Development is very rapid so there are no...

bug
performance

Create a function that takes in text as input, converts it into tokens, counts the tokens, and then returns the text with a maximum length that is limited by the...

enhancement
question

How can I train llama-3-8B-Instruct with my own dataset in csv format? Do you have a code or idea?

**Is your feature request related to a problem? Please describe.** PyTorch is able to install its CUDA dependencies via the above wheels during pip-install. Adding these wheels to the dependencies...

- Enables code interpreter/generation feature of functionary models by providing `{"type": "code_interpreter}` in one of the tools. - Adjust prompt template when code_interpreter tool is provided in the request -...

@abetlen Hello, when I use python -m llama_cpp.server deployed a llava13b service on the Kubernetes platform, I noticed an issue where only the first image could be correctly returned. When...

**Is your feature request related to a problem? Please describe.** Inquiring whether this project supports loading a "sharded" gguf model file ? The llama cpp project appears to add tooling...

Added code allows to specify multiple files to load via HuggingFace Hub in LLama.from_pretrained(). New argument takes a List of strings, which are used the same as the 'file_name' string...