vision-language-pretraining topic

List vision-language-pretraining repositories

SGA

37
Stars
2
Forks
Watchers

Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]

VALOR

259
Stars
15
Forks
Watchers

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

VideoGPT-plus

197
Stars
16
Forks
Watchers

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

adaptation_robustness

15
Stars
0
Forks
Watchers

Evaluate robustness of adaptation methods on large vision-language models

Janus

16.4k
Stars
2.2k
Forks
148
Watchers

Janus-Series: Unified Multimodal Understanding and Generation Models