llama-cpp-python issues

Implement GenerationTagIgnore Jinja2 extension

``` jinja2.exceptions.TemplateSyntaxError: Encountered unknown tag 'generation'. Jinja was looking for the following tags: 'elif' or 'else' or 'endif'. The innermost block that needs to be closed is 'if'. ``` llama-cpp-python...

hidehiroanto

Direct image input via PIL instead of Base64

**Describe the solution you'd like** Support for PIL library image input (path) instead of Base64 encoding. For example, when using models with transformers library, I provide images this way `img...

rudolphos

Periodic alignment with upstream

**Is your feature request related to a problem? Please describe.** I make lots of use of these bindings, but frequently find that new models depend on changes in the upstream...

handshape

unknown model architecture: 'gemma-embedding'

3

I am running llama-cpp-python Version: 0.3.16 Trying to load the recently released model [embeddinggemma-300M](https://huggingface.co/unsloth/embeddinggemma-300m-GGUF) I get the following error message: `llama_model_load: error loading model: error loading model architecture: unknown model...

mariocannistra

🚀 MAINTAINED FORK: inference-sh/llama-cpp-python – Active, Up-to-date, Contributors Welcome

6

Since this repo hasn’t been maintained in over 6 months and I couldn’t get in touch with the original author (@abetlen) via issues or socials, I’ve started a maintained fork:...

okaris

Support for MiniCPM-V 4.5

**Is your feature request related to a problem? Please describe.** Currently your documentation lists `minicpm-v-2.6` with `MiniCPMv26ChatHandler`. Since `MiniCPM-V 4.5` is out - could you please supprt it? **Describe the...

eximius313

Fixed a few typos in README.md

Hi, while reading the README, I found a few typos that I fixed. - `API to for basic` -> `API for basic` - `The model will will format` -> `The...

ImadSaddik

Added support for overriding tensor buffer types

2

Equivalent to the `-ot` llama.cpp argument: ``` {"--override-tensor", "-ot"}, "=,...", ``` Can be passed as an optionlal string to the `Llama` class using the new `override_tensor` parameter. Same format as...

zpin

Llama.cpp@tags/b6490

Updated to support Llama.cpp tags/b6490 fixes kv_cache errors lays groundwork for more future proof, raw, passthrough class which will interface more closely with llama.cpp

LongStoryMedia

Failed to load shared library on SnapDragon X Plus CPU, ERROR [Window Error 1114] A dynamic link library (DLL) Initialization routine failed.

I'm trying to test llama-cpp-python (CPU mode) on SnapDragon X Plus processor with python 3.12. `pip install llama-cpp-python --prefer-binary --no-cache-dir --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu` and it is successfully installed. but, when i...

NangMPLwin

llama-cpp-python
llama-cpp-python copied to clipboard

Metadata

Implement GenerationTagIgnore Jinja2 extension

Direct image input via PIL instead of Base64

Periodic alignment with upstream

unknown model architecture: 'gemma-embedding'

🚀 MAINTAINED FORK: inference-sh/llama-cpp-python – Active, Up-to-date, Contributors Welcome

Support for MiniCPM-V 4.5

Fixed a few typos in README.md

Added support for overriding tensor buffer types

Llama.cpp@tags/b6490

Failed to load shared library on SnapDragon X Plus CPU, ERROR [Window Error 1114] A dynamic link library (DLL) Initialization routine failed.

← Metadata

Owner

Metadata

llama-cpp-python llama-cpp-python copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama-cpp-python
llama-cpp-python copied to clipboard