vision-language-pretraining topic

List vision-language-pretraining repositories

awesome-japanese-llm

962
Stars
29
Forks
Watchers

日本語LLMまとめ - Overview of Japanese LLMs

PaddleMIX

345
Stars
128
Forks
Watchers

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

FLAIR

58
Stars
6
Forks
Watchers

FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.

DeCLIP

609
Stars
31
Forks
Watchers

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

ptp

144
Stars
4
Forks
Watchers

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

b2t

25
Stars
0
Forks
Watchers

Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation

Multimodality-Representation-Learning

66
Stars
7
Forks
Watchers

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl....

DeepSeek-VL

2.0k
Stars
190
Forks
13
Watchers

DeepSeek-VL: Towards Real-World Vision-Language Understanding

BLIText

21
Stars
1
Forks
Watchers

[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

VLMixer

17
Stars
1
Forks
Watchers

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)