llama-cpp-python issues

Problems when i try to use this inside the default python 3.10 docker container

28

When i try to install and use this package via a requirements file in the default 3.10 python container i get the following error when i try to import the...

LLukas22

Create bugs/performance issue tracking template

Working version (draft) for #74 Trying to find a nice cross platform method of fetching info simply (and include dep versions) ex: ``` > npx envinfo --system --npmPackages --languages --IDEs...

MillionthOdin16

[Feature] Dynamic Model Loading and Model Endpoint in FastAPI

5

I'd like to propose a future feature I think would add useful flexibility for users of the `completions/embeddings` API . I'm suggesting the ability to dynamically load models based on...

MillionthOdin16

enhancement

server

Unexpected output

6

EDIT: I'm running this on an M1 Macbook. Using the model directly works as expected, but running it through Python gives me this output. The `.dylib` binary is built from...

jessejohnson

bug

Why is the latest version 2x slower?

6

0.1.32 is 2x slower than 0.1.27 I tried using `use_mlock=TRUE`, warned me about RLIMIT and had to `ulimit -l unlimited` temporarily, but it still didn't improve. Is anyone else getting...

Bloob-beep

Bugfix: Fix broken: UnicodeDecodeError: 'utf-8' codec can't decode

10

riverzhou

Create a GitHub ISSUE_TEMPLATE for Bugs / Performance Regressions

Need to get structured information upfront, can still leave feature requests / etc free form.

abetlen

documentation

good first issue

high-priority

Fix v1/chat/completions Gibberish API Responses

8

The chat completion api specifically in fastapi wasn't doing a very consistent job in completing chat. The results seem to consistently generate gibberish (like `\nA\n/imagine prompt: User is asking about...

keldenl

Issue with emoji decoding

1

When the model wants to output an emoji, this error comes up: `Debugging middleware caught exception in streamed response at a point where response headers were already sent. Traceback (most...

CyberTimon

[WinError 193] When trying to run the high level API example with vicuna

2

I ran pip install llama-cpp-python and the installation was a success, then I created a python file and copied over the example text in the readme. The only change I...

ChilliSawse

llama-cpp-python
llama-cpp-python copied to clipboard

Metadata

Problems when i try to use this inside the default python 3.10 docker container

Create bugs/performance issue tracking template

[Feature] Dynamic Model Loading and Model Endpoint in FastAPI

Unexpected output

Why is the latest version 2x slower?

Bugfix: Fix broken: UnicodeDecodeError: 'utf-8' codec can't decode

Create a GitHub ISSUE_TEMPLATE for Bugs / Performance Regressions

Fix v1/chat/completions Gibberish API Responses

Issue with emoji decoding

[WinError 193] When trying to run the high level API example with vicuna

← Metadata

Owner

Metadata

llama-cpp-python llama-cpp-python copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama-cpp-python
llama-cpp-python copied to clipboard