language-vision topic

List language-vision repositories

uform

913
Stars
53
Forks
Watchers

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

RLIPv2

96
Stars
3
Forks
Watchers

[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training