vision-language-pretraining topic

List vision-language-pretraining repositories

LAVIS

8.9k
Stars
887
Forks
69
Watchers

LAVIS - A One-stop Library for Language-Vision Intelligence

Continual-CLIP

68
Stars
2
Forks
Watchers

Official repository for "CLIP model is an Efficient Continual Learner".

protoclip

43
Stars
0
Forks
Watchers

📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)

Video-LLaMA

2.5k
Stars
229
Forks
15
Watchers

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Video-ChatGPT

983
Stars
87
Forks
Watchers

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for...

FLM

31
Stars
2
Forks
Watchers

Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

SegCLIP

74
Stars
8
Forks
Watchers

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

svl_adapter

19
Stars
3
Forks
Watchers

SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

COSA

36
Stars
1
Forks
Watchers

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model