anything-llm
anything-llm copied to clipboard
Add multi-modal support for image generation (Ollama LLava, GPT4V, DALLE, SD)
Ollama support LLaVA model (Image to Text). I am wondering whether AnthingLLM can support to upload an image in chat window and then ask any question on the upload image?
And Localai supports llava too
This would be great!
For reference, ollama webui has this feature if it helps. https://github.com/ollama-webui/ollama-webui/issues/220
Would be a great feature!