vision-and-language topic

List vision-and-language repositories
trafficstars

awesome-vision-language-pretraining-papers

1.1k
Stars
101
Forks
Watchers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

ViLT

1.3k
Stars
203
Forks
Watchers

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

DL-NLP-Readings

849
Stars
269
Forks
Watchers

My Reading Lists of Deep Learning and Natural Language Processing

UNITER

766
Stars
108
Forks
Watchers

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"

VL-T5

353
Stars
58
Forks
Watchers

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

X-VLM

434
Stars
52
Forks
Watchers

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

conceptual-12m

327
Stars
16
Forks
Watchers

Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.

awesome-vision-and-language

400
Stars
33
Forks
Watchers

A curated list of awesome vision and language resources (still under construction... stay tuned!)

Awesome-Computer-Vision

212
Stars
40
Forks
Watchers

Awesome Resources for Advanced Computer Vision Topics

TCL

253
Stars
33
Forks
Watchers

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022