Ettore Di Giacinto

https://mudler.pm

Italy ex-SUSE/Rancher, ex-gentoo, ex-Sabayon, @mocaccinoOS

Results 650 comments of


                                            Ettore Di Giacinto

sudo docker start local-ai Error response from daemon: load library failed: libnvidia-ml.so.1

@baditaflorin sounds you are having installation issues with the Nvidia container toolkit. Did you followed their docs? https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html#installing-with-ap

Inferencing not working with P2P in latest version.

> Still not able to do p2p inferencing even if workers are online `v2.25.0 (07655c0c2e0e5fe2bca86339a12237b69d258636)` ![Image](https://github.com/user-attachments/assets/3ea7b0fe-f8c0-435d-b764-47e638416c39) > > server and workers envs > > CONTEXT_SIZE: "512" > THREADS: "4"...

Inferencing not working with P2P in latest version.

> There you go. let me know if you need anything else > > [localai-server.log](https://github.com/user-attachments/files/18795950/localai-server.log) [localai-worker-1.log](https://github.com/user-attachments/files/18795951/localai-worker-1.log) mmh ok that looks weird: what's the environment? it looks like they can auto-discover...

Method to unload model(s) would be very useful.

To clarify here @j4ys0n - are you referring to unload models from a group of federated workers right? or are you referring to llama.cpp workers? JFYI we have `/backend/shutdown` for...

[llama-cpp] Fails: backend not found: /tmp/localai/backend_data/backend-assets/grpc/llama-cpp

this should be fixed by https://github.com/mudler/LocalAI/pull/3789

Support Local models with LocalAI

oh nice! that's cool! maybe we can close this already, or maybe want to keep it open until we have an e2e working example?

Add WebUI API token authorization

This is looking nice, thank you! just few small nits

:book: How to create dedicated appliance with pre-installed software

This is going to be a lot interesting with https://github.com/kairos-io/kairos/issues/3244

:seedling: Conformance tests

https://github.com/kairos-io/kairos/issues/3606

Consider having a smaller flavor

This issue will be covered by #3388

‹
1
2
...
51
52
53
54
55
56
57
...
64
65
›