image-text topic
ALBEF
Code for ALBEF: a new vision-language pre-training method
Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching
Deep Cross-Modal Projection Learning for Image-Text Matching
crawlingathome
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
poster-editor
Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.
mPLUG
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
DeCLIP
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
QualiCLIP
Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment
imageinwords
Data release for the ImageInWords (IIW) paper.