vision-and-language-pre-training topic

List vision-and-language-pre-training repositories

BLIP

4.3k
Stars
573
Forks
Watchers

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Awesome_Matching_Pretraining_Transfering

434
Stars
49
Forks
434
Watchers

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insigh...

A curated list of vision-and-language pre-training (VLP). :-)

Chinese-CLIP

3.8k
Stars
404
Forks
Watchers

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

SIC-CADS

21
Stars
3
Forks
Watchers

Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)