frob

Results 843 comments of frob

Vision support for llama3.2. llama3.2 doesn't do video, and [doesn't work reliably with multiple images](https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct/discussions/43#:~:text=image%20for%20inferencing%2C-,the%20model%20doesn%27t%20work%20reliably%20well%20with%20multiple%20images,-JOJOHuang).

Version [0.4.0](https://github.com/ollama/ollama/releases/tag/v0.4.0-rc3) will support llama3.2-vision.

It should run on GPU if it fits: ```console $ ollama ps NAME ID SIZE PROCESSOR UNTIL x/llama3.2-vision:latest 25e973636a29 11 GB 100% GPU Forever ``` If you can provide [server...

Yep, too big for your card.

0.3.14 cannot load x/llama3.2-vision.

If the image is large, it will exceed the maximum argument length of the shell. ```sh (echo '{ "model":"x/llama3.2-vision", "messages":[ { "role":"user", "content":"describe this image", "images":["' ; curl -s https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg...

[Server logs](https://github.com/ollama/ollama/blob/main/docs/troubleshooting.md#how-to-troubleshoot-issues) will help in debugging. ```console $ curl localhost:11434/api/version {"version":"0.4.0-rc8"} $ (echo '{ "model":"x/llama3.2-vision", "messages":[ { "role":"user", "content":"describe this image", "images":["' ; curl -s https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg | base64 -w0 ;...

> Is it because of the large num_ctx value? Yes. https://github.com/ollama/ollama/issues/7288#issuecomment-2591709109

I am unable to replicate: ```console $ curl -s http://localhost:11434/api/version {"version":"0.3.14"} $ curl -s localhost:11434/api/embed -d '{"model":"viosay/conan-embedding-v1","input":"Why is the sky blue?"}' | jq '.embeddings=[.embeddings[]|length]' { "model": "viosay/conan-embedding-v1", "embeddings": [ 1024...