vision-language topic

List vision-language repositories

NExT-OE

25
Stars
1
Forks
Watchers

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

rewrite

18
Stars
0
Forks
Watchers

[NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

TrackGPT

23
Stars
0
Forks
Watchers

TrackGPT: Track What You Need in Videos via Text Prompts

TinyLLaVA_Factory

604
Stars
54
Forks
Watchers

A Framework of Small-scale Large Multimodal Models

ProText

86
Stars
4
Forks
Watchers

[CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".

Sambor

30
Stars
0
Forks
Watchers

Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning

MEP-3M

20
Stars
0
Forks
Watchers

🎁 A Large-scale Multi-modal E-Commerce Products Dataset (LTDL@IJCAI-21 Best Dataset & Pattern Recognition 2023)

DeCEMBERT

17
Stars
1
Forks
Watchers

Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)

VLMixer

17
Stars
1
Forks
Watchers

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)