Andrew Phillips comments

Results 20 comments of


                                            Andrew Phillips

added multi architecture(arm64,amd64) support

@manojm-dev Sorry, I hadn't noticed this PR. I would prefer if this didn't change the default port, could you make it use a `PORT` env variable instead?

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

I guess this is the same problem as in #5022.

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

@sammcj, I don't think your patch fixes this, but now I have a larger context anyway. This PR seems to indicate there is some double counting going on in the...

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

@rick-github Good point about `num_gpu` (note that `ollama ps` will still say it's splitting it even if it doesn't), however I think the estimate is also used when deciding to...

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Are we sure the ollama ps output is in base 10? From my earlier comment: ``` NAME ID SIZE PROCESSOR UNTIL DEFAULT/mistral-small-2409-22b:latest d9db479f49e8 24 GB 100% GPU Forever ``` I...

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Sorry, yeah that's painfully obvious now that I'm re-reading it later. You are correct.

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

FYI, I made a PR to add `ollama ps --base2`: https://github.com/ollama/ollama/pull/8034 ``` industrial:~/projects/ollama-src$ ./ollama ps --base2 NAME ID SIZE PROCESSOR UNTIL DEFAULT/mistral-small-2409-22b:latest 671ad04c21ce 24.4 GiB 7%/93% CPU/GPU Forever industrial:~/projects/ollama-src$ nvidia-smi...

Andrew Phillips

added multi architecture(arm64,amd64) support

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Ollama ps says 22 GB, but nvidia-smi says 16GB with flash attention enabled

Do not add a beforeunload event listener when running inside NodeJS

Error: undefined is not an object (evaluating 'x3.data.providers' )