frob comments

Results 849 comments of


                                            frob

Would it be possible to save chat history into season data?

It sounds like a simple terminal frontend would do what you need, see [Terminal](https://github.com/ollama/ollama?tab=readme-ov-file#terminal) in the integrations page.

qwen 2.5 coder stuck "Stopping"

If it's an embedding issue, it might be #7288. If the chunk size for the embed is larger than the context window it causes problems.

qwen 2.5 coder stuck "Stopping"

The likely problem is that the client has a timeout, and has sent so many embedding requests that ollama can't respond before the client times out and closes the connection.

Performance Regression on Apple Silicon M1: GPU → CPU Fallback in v0.12.9 (works correctly in v0.12.5)

[Server log](https://docs.ollama.com/troubleshooting) may help in debugging.

Performance Regression on Apple Silicon M1: GPU → CPU Fallback in v0.12.9 (works correctly in v0.12.5)

[Server log](https://docs.ollama.com/troubleshooting) may help in debugging.

Performance Regression on Apple Silicon M1: GPU → CPU Fallback in v0.12.9 (works correctly in v0.12.5)

``` time=2025-11-07T14:28:07.694+01:00 level=INFO source=server.go:653 msg="loading model" "model layers"=49 requested=1 ``` @ComplexPlaneDev Have you set `num_gpu` for this model?

Performance Regression on Apple Silicon M1: GPU → CPU Fallback in v0.12.9 (works correctly in v0.12.5)

The [parameters](https://ollama.com/jobautomation/OpenEuroLLM-Italian:latest/blobs/3d0216c791fa) for this model explicitly limit the GPU layer count to 1, which will account for the slowness. This should have been the case in previous ollama versions. Since...