frob comments

Results 843 comments of


                                            frob

How can I connect to Ollama's server?

What does the following return: ``` curl localhost:11434 ```

How can I connect to Ollama's server?

``` Ollama is running ``` ollama is working. If your app doesn't, that's a problem with the app or your proxy configuration. Try setting ``` os.environ["no_proxy"] = "127.0.0.1,localhost" ```

How can I connect to Ollama's server?

Then you need to figure out why your proxy is not routing traffic to 127.0.0.1:11434 to 127.0.0.1:11434. ollama doesn't need a proxy, if your app does then it's not an...

How can I connect to Ollama's server?

> Does Ollama require users to configure a proxy? No. If you do have a proxy, you need to configure it to allow clients to connect to the ollama port.

Ollama run very very slow in ARM cpu (KunPeng 920 CPU)

You have no GPU accelerator and the 920 apparently doesn't implement the ARM matrix extensions (SME) so you are relying on brute force CPU. For LLM inference workloads, it's just...

Ollama run very very slow in ARM cpu (KunPeng 920 CPU)

Suggestions for how to increase token generation? Run with GPU acceleration or faster hardware. You can try running other inference engines on the KunPeng and see if they perform better,...

Ollama run very very slow in ARM cpu (KunPeng 920 CPU)

You can try running other inference engines on the KunPeng and see if they perform better, and even run them on different hardware platforms as a comparison. That might give...

Error:" the current context does not support k-shift " deepseek-r1:671b crashes in memory after answering several questions and then reloads to memory again

https://github.com/ollama/ollama/issues/5975

Error:" the current context does not support k-shift " deepseek-r1:671b crashes in memory after answering several questions and then reloads to memory again

If your input tokens + output tokens > num_ctx, the model will fail due to k-shift. So if you want to use longer prompts (multiple Q&A), you need to increase...

Error:" the current context does not support k-shift " deepseek-r1:671b crashes in memory after answering several questions and then reloads to memory again

Increase `num_ctx` so that it's big enough to hold the input tokens and the output tokens. You still have to control the users input.