Andrei
Andrei
Hi there, first of all thank you for all the amazing work. I'm currently maintaining some python projects that bind to c++ code in a shared library via the python...
With the update to v1 OpenAI's API changed significantly, while backwards compatibility was straightforward to preserve on the server the python API is lagging. The main difference in the pre...
Open to suggestions / assistance on how to make installation easier and less error prone. One thought is to add better platform detection to the cmakelists and provide better docs...
Closes #771 > This is going to be a big PR as it requires refactoring a good deal of the `Llama` class to make it thread safe and support multiple...
Hi @saharNooby, first off amazing work in this repo, I've been looking for a cpu implementation of RWKV to experiment with using the pre-trained models (don't have a large gpu)....
Hi @nicholasyager great work on the repo. I was wondering if you'd consider opening a PR for the official Microsoft guidance project to add support for llama-cpp-python. Cheers.
Very much still a work in progress however I've been able to convert the weights and update the clip model to support the differences in the PaliGemma implementation of SigLIP...
# Steps ``` uv pip install llama-cpp-python ``` # Output ``` Failed to download and build `llama-cpp-python==0.2.87` Caused by: Failed to extract archive Caused by: failed to unpack `/home/andrei/.cache/uv/built-wheels-v3/.tmp5H3yE1/llama_cpp_python-0.2.87/vendor/llama.cpp/spm-headers/ggml-alloc.h` Caused...