vision-language topic

List vision-language repositories

VAST

235

Stars

15

Forks

Watchers

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

cross-modality-pretraining

multimodal-foundation-model

VLN-BEVBert

166

Stars

4

Forks

Watchers

[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"

vision-language

STALE

97

Stars

8

Forks

Watchers

[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "

action-detection

temporal-action-detection

MCM

59

Stars

7

Forks

Watchers

PyTorch implementation of MCM (Delving into out-of-distribution detection with vision-language representations), NeurIPS 2022

deeplearning-wisc

contrastive-learning

out-of-distribution-detection

representation-learning

vision-language

CG-VLM

21

Stars

1

Forks

Watchers

This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.

contrastive-learning

data-efficient-learning

instruction-following

PODA

104

Stars

11

Forks

Watchers

[ICCV 2023] Official implementation of "PØDA: Prompt-driven Zero-shot Domain Adaptation"

computer-vision

domain-adaptation

CoTConsistency

29

Stars

1

Forks

Watchers

The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".

chain-of-thought

large-language-models

Clip2Protect

96

Stars

11

Forks

Watchers

[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".

face-manipulation

face-recognition

RemoteCLIP

277

Stars

18

Forks

Watchers

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

contrastive-language-image-pretraining

vision-language

Awesome-RSITR

40

Stars

0

Forks

Watchers

🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)｜ Remote Sensing Cross-Model Retrieval (RSCMR) | Remote Sensing Vision-Lanuage Models (RSVLMs)

cross-model-retrieval

remote-sensing-image-text-retrieval

vision-language