frob comments

Results 846 comments of


                                            frob

Ollama Creat 手动部署报错 Error: invalid file magic

My advice was based on that specific file. A different file will require different corrective measures.

WebSocket Error within a short period after receiving a response

I see this when the server is behind a proxy. The proxy disconnects connections that have been idle for a while (60 seconds in my case). I'm unable to influence...

OpenAI Chat Completion Client For Multimodal

What is inconsistent?

Ability to specify GPU priority for model splitting, and don't split model unless needed

> Don't split models at all unless you need to, ollama already does this. > and when you do need to split, split in this order of cards: 3060 1,...

Ability to specify GPU priority for model splitting, and don't split model unless needed

If the model is being distributed across multiple devices, ollama thinks it doesn't fit in one GPU. Look at the logs for lines with `source=sched.go`, they will show the decisions...

Ability to specify GPU priority for model splitting, and don't split model unless needed

Note if you have set `OLLAMA_SCHED_SPREAD=1` then ollama will always try to spread the model.

Feature: Add Support for Distributed Inferencing

ollama runs [needsReload](https://github.com/ollama/ollama/blob/e9e9bdb8d904f009e8b1e54af9f77624d481cfb2/server/sched.go#L574) before each request. It includes a check for changes in parameters to the llama server, so if the rpc backends change, it should cause a model reload.

frob

Ollama Creat 手动部署报错 Error: invalid file magic

WebSocket Error within a short period after receiving a response

OpenAI Chat Completion Client For Multimodal

Ability to specify GPU priority for model splitting, and don't split model unless needed

Ability to specify GPU priority for model splitting, and don't split model unless needed

Ability to specify GPU priority for model splitting, and don't split model unless needed

Feature: Add Support for Distributed Inferencing

how to force ollama to use different cpu runners / how to compile windows avx512 runner?

how to force ollama to use different cpu runners / how to compile windows avx512 runner?

how to force ollama to use different cpu runners / how to compile windows avx512 runner?