Patrick Devine comments

Results 323 comments of


                                            Patrick Devine

StableLM-Zephyr incompatible with Ollama version

Thanks for reporting this, @horiacristescu. What type of GPU are you using? I'm assuming if you reinstalled you got the newest version? Do the other quantized versions work? I just...

Ubuntu desktop freezing for a few minutes

Hey @horiacristescu , sorry for the really slow response on this. Are you still seeing the system freezes? I'm wondering if you are deep into swap on your machine? You...

Ubuntu desktop freezing for a few minutes

LMK if this answers each of your questions and if we can close out the issue.

Pull model menifest connect timed out

@AmarkanthJinna did you try the instructions mentioned above? It's also possible that you'll need to get your company to open up the proxy.

phi not working

@morandalex with the Zephyr model it looks like you're running out of memory on the GPU (it looks like the GPU only has 4GB of ram), whereas it seems like...

delete command line history inside ollama

@Ch-i to delete the old one, just `rm ~/.ollama/history`.

loading the model into GPU direct

This is essentially what Ollama does. It tries to offload as many layers of the model as possible into the GPU, and then if there is not enough space, will...

loading the model into GPU direct

Going to go ahead and close out the issue.

A way to update all downloaded models

See #2179

remove client.py

Looks fine. You should probably stick a note in the README and the docs about how to get the official python client.