Ettore Di Giacinto comments

Results 650 comments of


                                            Ettore Di Giacinto

diffuser backend processes stack up and hog GPU memory

> happens on my AIO image freshly updated from the latest tag @mudler Latest tag doesn't have the fix yet, only master images

diffuser backend processes stack up and hog GPU memory

I will try to test this locally soon - I was facing https://github.com/mudler/LocalAI/pull/3377 to start with that made unpractical to test this scenario, will report soon-ish.

diffuser backend processes stack up and hog GPU memory

> Any news on this? > I am facing the same issue with a M60 and Docker (latest cuda12). > Using different models or jumping from image generation to chat...

Add the new Multi-Modal model of mistral AI: mistral-small-3.1-24b & pixtral-12b

I guess that would work already with llama.cpp GGUF models if/when is getting supported in there ( see also https://github.com/ggerganov/llama.cpp/issues/9440 ). I'd change the focus of this one to be...

Add the new Multi-Modal model of mistral AI: mistral-small-3.1-24b & pixtral-12b

See also: https://github.com/ggerganov/llama.cpp/issues/9455

Add the new Multi-Modal model of mistral AI: mistral-small-3.1-24b & pixtral-12b

> BTW: "(Coming very soon) 11B and 90B Vision models > > 11B and 90B models support image reasoning use cases, such as document-level understanding including charts and graphs and...

Add the new Multi-Modal model of mistral AI: mistral-small-3.1-24b & pixtral-12b

> It seems they work independently on that [ollama/ollama#6963](https://github.com/ollama/ollama/pull/6963) that looks only golang-side of things to fit the images. The real backend changes seems to be in https://github.com/ollama/ollama/pull/6965

After updating to 2.24 LLM hangs after first response

Thanks for raising this up, confirmed and patch release is on its way already :+1:

After updating to 2.24 LLM hangs after first response

this should be fixed in 2.24.1

[factory web ui] add trusted boot support

> IMO, we should not do this. Not handling PRIVATE keys in web its nice and we cnanot be held responsible for losing them. > Nothing forbids to run the...