language-vision topic
List
language-vision repositories
uform
913
Stars
53
Forks
Watchers
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
RLIPv2
96
Stars
3
Forks
Watchers
[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training