node-llama-cpp icon indicating copy to clipboard operation
node-llama-cpp copied to clipboard

feat: pass an image as part of the evaluation

Open giladgd opened this issue 2 years ago • 11 comments

When llama.cpp's support for this will be stable. Hopefully, there will be an official API for this after https://github.com/ggml-org/llama.cpp/pull/11292 is implemented.

giladgd avatar Nov 05 '23 16:11 giladgd

Interested in this kind of multimodal support. Any update on progress?

samlhuillier avatar Nov 22 '23 22:11 samlhuillier

Does this encompass adding support for running llava models or should that be a separate feature request? I noticed that llama-cpp-python already includes llava support from llama.cpp so this shouldn't be too hard with setting up the bindings.

fozziethebeat avatar Dec 02 '23 03:12 fozziethebeat

I haven't started working on this yet, but it is planned as part of the roadmap. The plan is to add support for llama.cpp's ability to pass an image to a model, which right now only supports LLaVA.

I'll work on this once llama.cpp's API for this is final, to prevent frequent breaking API changes (unlike what happens on some other libraries)

giladgd avatar Dec 03 '23 23:12 giladgd

Make sense. Hopefully llama.cpp finalizes that API.

fozziethebeat avatar Dec 05 '23 02:12 fozziethebeat