jmtatsch comments

Results 145 comments of


                                            jmtatsch

Running a Vicuna-13B 4it model ?

It's extremely slow on my M1 MacBook (unusable), quite usable on my 4 yr old i7 workstation. And doesn't work at all on the same workstation inside docker. Found #767,...

Running a Vicuna-13B 4it model ?

There is a vicuña model rev1 with some kind of stop fix on 🤗 . Maybe that solves your issue?

Problems when i try to use this inside the default python 3.10 docker container

I run within a Ubuntu container which works. https://github.com/mkellerman/gpt4all-ui/ runs a 3.11 container and it works so I would guess the issue is not with llama-cpp-python but with your concrete...

Problems when i try to use this inside the default python 3.10 docker container

Took the opportunity to shrink my own dockerfile: ``` FROM python:3.10 COPY .devops/requirements.txt requirements.txt RUN pip install -r requirements.txt && rm -rf requirements.txt ENTRYPOINT [ "python3", "-m", "llama_cpp.server" ] ```...

[Windows] Exception when trying to load a model

Seems like loading the model already fails. Double check your model path.

Bugfix: Fix broken: UnicodeDecodeError: 'utf-8' codec can't decode

Very funny indeed. My vicuna prefers to answer me in Chinese. With this fix at least it can without erroring out.

Cache Feature Request

I agree the "dummy" caching feature is already really useful. Makes all the difference between me wanting to use it or rather going to openai ;) Regarding a real caching...

Add settings for custom base url and embedding dimension

> I'm thinking of trying to get it to work with my videocard, since it is the most high end part of my pc, but am not quite sure yet...

[User] Deadlock if number of threads > number of (hyper)threads

Maybe I didn't have the patience to really wait it out. However it wastes a ton of cpu cycles / energy.

Fix for silent llama

@dibrale Thanks, that solves the issue for me. However there are a couple of changes that don't seem necessary for me. Maybe you can explain a bit why you chose...