Patrick Devine comments

Results 323 comments of


                                            Patrick Devine

GGUF imported model crashes only in v0.1.38

@mindspawn Can you attach the logs and the Modelfile? Also, do you have a link to the gguf binary, or did you convert it yourself?

v0.1.38 OLLAMA_HOST no longer works.

I just tried it and everything in `0.1.38` seems to be working just fine. `OLLAMA_HOST=0.0.0.0:11434 ollama serve` and then on a separate host: ``` OLLAMA_HOST=x.x.x.x ollama run llama3 >>> hi...

After updating nvidia drivers in my host, ollama inside a docker container running ubuntu does not use GPU

What's the output of `ollama ps`?

Ollama reload same model when called in different python scripts

@x66ccff can you try updating to ollama `0.1.38`?

Is the GPU working?

@15731807423 what's the output of `ollama ps`? It should tell you how much of the model is on the GPU and how much is on the CPU.

Is the GPU working?

@15731807423 looks like 70b is being partially offloaded, and 8b is fully running on the GPU. When you do `/set verbose` how many tokens / second are you getting? With...

Is the GPU working?

It is using the GPU, but it's not particularly *efficient* at using it because the model is split across the CPU and GPU and the limitations of the computer (like...

Ollama: running Vite in production mode fails

cc @BruceMacD

Import a model:latest aborted (core dumped)

Can you post the Modelfile and the logs? What was the gguf you were using?

Import a model:latest aborted (core dumped)

Can you include the Modelfile as well?