vision-language-pretraining topic

List vision-language-pretraining repositories
trafficstars

LAVIS

9.3k
Stars
921
Forks
Watchers

LAVIS - A One-stop Library for Language-Vision Intelligence

Continual-CLIP

103
Stars
6
Forks
Watchers

Official repository for "CLIP model is an Efficient Continual Learner".

protoclip

43
Stars
0
Forks
Watchers

📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)

Video-LLaMA

2.7k
Stars
242
Forks
Watchers

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Video-ChatGPT

1.2k
Stars
102
Forks
Watchers

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for...

FLM

31
Stars
2
Forks
Watchers

Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

SegCLIP

78
Stars
8
Forks
Watchers

PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"

svl_adapter

19
Stars
3
Forks
Watchers

SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

COSA

38
Stars
2
Forks
Watchers

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model