Hagen Hübel comments

Results 65 comments of


                                            Hagen Hübel

trafficstars

Hangs after 20-30 mins, a perdiocal restart of the ollama service is required

I've conducted extensive testing today and encountered an issue with the Ollama model while running a FastAPI-based Python API on a GPU machine (RTX 4000 from Hetzner). Here are the...

Hangs after 20-30 mins, a perdiocal restart of the ollama service is required

Btw, I found also out, that the invokation of `ollama.chat` is blocking the full python process. I can not even call another endpoint during that time, not even the "/docs"-endpoint...

CUDA not supported. `ValueError: Attempt to split tensors that exceed maximum supported devices. Current LLAMA_MAX_DEVICES=1`

I was running into the same, reported here https://github.com/abetlen/llama-cpp-python/issues/1693

CUDA unsupported, installation bypasses CUDA at all

For the sake of completeness: if someone is looking for a llama.cpp binding that works with CUDA support, no matter which underlying programming language, I can recommend the NodeJS bindings:...

Permission errors when restoring database backups in local development

thx, @sweatybridge > The permission issues are due to old versions of postgres. In order to fix them, you need to do a `db dump` locally after restoring from backup....