vision-language-pretraining topic
List
vision-language-pretraining repositories
CVPR2024_MAVL
47
Stars
0
Forks
Watchers
Multi-Aspect Vision Language Pretraining - CVPR2024
SGA
37
Stars
2
Forks
Watchers
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models. [ICCV 2023 Oral]
VALOR
259
Stars
15
Forks
Watchers
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
VideoGPT-plus
197
Stars
16
Forks
Watchers
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
adaptation_robustness
15
Stars
0
Forks
Watchers
Evaluate robustness of adaptation methods on large vision-language models
Janus
16.4k
Stars
2.2k
Forks
148
Watchers
Janus-Series: Unified Multimodal Understanding and Generation Models