Zoli Somogyi comments

Results 93 comments of


                                            Zoli Somogyi

Update LLamaEmbedder, Examples packages, and KernelMemory examples

Test failed: It runs on my computer, I do not see any problem (nothing important has changed compared to before when all tests were OK). Try to restart the test....

Update LLamaEmbedder, Examples packages, and KernelMemory examples

> I've restarted the tests, but I'm not expecting them to pass right now. We seem to have an issue with one specific test at the moment that consistently fails...

Update LLamaEmbedder, Examples packages, and KernelMemory examples

I have corrected several problems in the code and I could make all tests pass (in the macOSX tests with skipping only 1). One of the problems I have corrected...

Update LLamaEmbedder, Examples packages, and KernelMemory examples

> Could you split the csproj file downloading changes to a separate PR? The `SkipUnchangedFiles="true"` attribute should prevent duplicate downloads, so we'll want to look into that. Having it all...

Update LLamaEmbedder, Examples packages, and KernelMemory examples

> Sorry for the delay on this, I'd hoped to get the binary update PR in first but that's taking a long time! Thank you Martin! Yes, I was expecting...

Dramatic increase of the C++ dlls size

Thank you, Martin. I am a bit hesitant because I’m confused about how [ggerganov](https://github.com/ggerganov) still manages to maintain the former file sizes (~60 MB for the DLL), while we are...

Dramatic increase of the C++ dlls size

I believe I've identified the issue. I now have a smaller ggml-cuda.dll (51MB for the Windows Release with one architecture and 157MB with two architectures). The issue seems to stem...

Dramatic increase of the C++ dlls size

> > They are in that build, search for "MB" and that'll highlight the relevant items: > > ``` > ggml-cuda-bin-linux-cublas-cu11.7.1-x64.so : 175MB > ggml-cuda-bin-linux-cublas-cu12.8.0-x64.so : 113MB > ggml-cuda-bin-win-cublas-cu11.7.1-x64.dll :...

Dramatic increase of the C++ dlls size

**IMPORTANT NEWS**: The last master from llama.cpp corrects the BUG with the DLL sizes and also the CUDA graphs problem (https://github.com/ggml-org/llama.cpp/issues/12152) that was causing regular crashes in LLamaSharp. Unfortunately 'llama_model_params'...

[BUG]: Get GGML_ASSERT when running KernelMemorySaveAndLoad.cs

Try to set the Embeddings parameter in ModelParams to true.