vision-language-model topic

List vision-language-model repositories

menghini-neurips23-code

39
Stars
3
Forks
Watchers

Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.

RobustVLM

52
Stars
3
Forks
Watchers

[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models

LOVM

17
Stars
0
Forks
Watchers

[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection

DeepSeek-VL

1.7k
Stars
170
Forks
13
Watchers

DeepSeek-VL: Towards Real-World Vision-Language Understanding

awesome-vlm-architectures

164
Stars
11
Forks
Watchers

Famous Vision Language Models and Their Architectures

LMPT

49
Stars
2
Forks
Watchers

LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition

MGM

3.0k
Stars
273
Forks
25
Watchers

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

ViECap

134
Stars
4
Forks
Watchers

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

PromptKD

123
Stars
1
Forks
Watchers

[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"

A curated list of prompt learning methods for vision-language models.