djl
djl copied to clipboard
[llama.cpp] Update llama.cpp to latest release b2581 (2024-03-30)
In order to get support for BERT based sentence embedding models like BAAI/bge-base-en-v1.5, mixedbread-ai/mxbai-embed-large-v1, or others, update llama.cpp from b1696 (2023-12-12) to the current latest release b2581 (2024-03-30).
BERT support was added to llama.cpp in February 2024: https://github.com/ggerganov/llama.cpp/pull/5423
This change has not yet been tested. Maybe updating the Gradle property llamacpp_version is not enough and ai_djl_llama.cpp needs to be adapted as well. If so, please do so.
I see. Thanks for taking the time to try it.
The native libraries are built for all platforms with Native S3 llama.cpp, right? I can't find the log of the failed build. Could you please share it?