llm
llm copied to clipboard
Multi-modal support for vision models such as GPT-4 vision
https://platform.openai.com/docs/guides/vision
I think this is best handled by command line options --image
and --image-urls
to either encode and pass as base64, or to pass a URL.