anything-llm icon indicating copy to clipboard operation
anything-llm copied to clipboard

Add multi-modal support for image generation (Ollama LLava, GPT4V, DALLE, SD)

Open jameschen83 opened this issue 1 year ago • 4 comments

Ollama support LLaVA model (Image to Text). I am wondering whether AnthingLLM can support to upload an image in chat window and then ask any question on the upload image?

jameschen83 avatar Jan 09 '24 16:01 jameschen83

And Localai supports llava too

lunamidori5 avatar Jan 12 '24 05:01 lunamidori5

This would be great!

Anto79-ops avatar Feb 03 '24 14:02 Anto79-ops

For reference, ollama webui has this feature if it helps. https://github.com/ollama-webui/ollama-webui/issues/220

rquast avatar Feb 14 '24 23:02 rquast

Would be a great feature!

Chammar37 avatar Apr 26 '24 20:04 Chammar37