Hunter Chasens comments

Results 35 comments of


                                            Hunter Chasens

Add c4ai-command-r-v01 Support

I couldn't get it running on Linux and a 7900xtx, tried both transformers and llamaCPP.

[bug]: PatchMatch does not load on Archlinux, returns notfatal error.

> A workaround that worked for me (Archlinux) is to use the systems `libgomp.so` instead of the included one. If you want to try it for yourself: > > 1....

Add support for GBNF grammar definitions

@brucemacd Any update on this?

add support for oauth2 with preregistered token

Making `cyrus-sasl-xoauth2-git` a dependency might be an issue. If you're not on Arch than you don't have access to the AUR. Without it, installing `cyrus-sasl-xoauth2-git ` requires building it from...

add support for oauth2 with preregistered token

I definitely think OAuth is worth pursuing. I tested your fork on Fedora. Due to the difficulties setting up the dependencies, I found it much easier to just manually configure...

Proxying container exposing multiple ports (but only one HTTP)

maybe by editing the "{{json .NetworkSettings.Networks }}"

Better ollama load balancing

Does this take into account task affinity (model affinity) for model types? Ollama requires time to load in new models, so it makes sense to send requests of the same...

support 128k context length phi3

> Ollama has to wait for the upstream llama.cpp backend ([ggerganov/llama.cpp#6849 (comment)](https://github.com/ggerganov/llama.cpp/issues/6849#issuecomment-2072860077)) to support it first. I just got added. Next Ollama release with a pull from the llamacpp mainline...

[bug]: Segmentation fault on image generation start (AMD)

I'm seeing this with my 7900xtx

[bug]: Segmentation fault on image generation start (AMD)

So I figured it out. When using ROCm it tries to select your first GPU which is your integrated graphics. There's not enough VRAM so you get a segmentation fault....