djl icon indicating copy to clipboard operation
djl copied to clipboard

[llama.cpp] Update llama.cpp to latest release b2581 (2024-03-30)

Open howlger opened this issue 1 year ago • 1 comments
trafficstars

In order to get support for BERT based sentence embedding models like BAAI/bge-base-en-v1.5, mixedbread-ai/mxbai-embed-large-v1, or others, update llama.cpp from b1696 (2023-12-12) to the current latest release b2581 (2024-03-30).

BERT support was added to llama.cpp in February 2024: https://github.com/ggerganov/llama.cpp/pull/5423

This change has not yet been tested. Maybe updating the Gradle property llamacpp_version is not enough and ai_djl_llama.cpp needs to be adapted as well. If so, please do so.

howlger avatar Apr 01 '24 11:04 howlger

I see. Thanks for taking the time to try it.

The native libraries are built for all platforms with Native S3 llama.cpp, right? I can't find the log of the failed build. Could you please share it?

howlger avatar Apr 02 '24 08:04 howlger