vision-language-model topic
menghini-neurips23-code
Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.
RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
LOVM
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
LMPT
LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition
MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
ViECap
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023
PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
Awesome-Prompt-Learning-for-Vision-Language-Models
A curated list of prompt learning methods for vision-language models.