anything-llm Add multi-modal support for image generation (Ollama LLava, GPT4V, DALLE, SD)

Add multi-modal support for image generation (Ollama LLava, GPT4V, DALLE, SD)

Open jameschen83 opened this issue 1 year ago • 4 comments

Ollama support LLaVA model (Image to Text). I am wondering whether AnthingLLM can support to upload an image in chat window and then ask any question on the upload image?

Jan 09 '24 16:01 jameschen83

And Localai supports llava too

Jan 12 '24 05:01 lunamidori5

This would be great!

Feb 03 '24 14:02 Anto79-ops

For reference, ollama webui has this feature if it helps. https://github.com/ollama-webui/ollama-webui/issues/220

Feb 14 '24 23:02 rquast

Would be a great feature!

Apr 26 '24 20:04 Chammar37

anything-llm anything-llm copied to clipboard

Add multi-modal support for image generation (Ollama LLava, GPT4V, DALLE, SD)

anything-llm
anything-llm copied to clipboard