vision-language-pretraining topics

awesome-japanese-llm

962

Stars

29

Forks

Watchers

日本語LLMまとめ - Overview of Japanese LLMs

llm-jp

awesome

awesome-list

embeddings

japanese

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high per...

PaddlePaddle

aigc

blip2

clip

coca

FLAIR

58

Stars

6

Forks

Watchers

FLAIR: A Foundation LAnguage-Image model of the Retina for fundus image understanding.

jusiro

foundation-models

fundus-image-analysis

medical-imaging

vision-language-pretraining

DeCLIP

609

Stars

31

Forks

Watchers

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Sense-GVT

big-model

clip

image-text

multi-model

ptp

144

Stars

4

Forks

Watchers

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

sail-sg

cross-modality

vision-language-pretraining

vlp

b2t

25

Stars

0

Forks

Watchers

Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretation

alinlab

bias-and-fairness

explainable-ai

vision-language-pretraining

Multimodality-Representation-Learning

66

Stars

7

Forks

Watchers

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl....

marslanm

cross-modal

multimodal-applications

multimodal-datasets

multimodal-deep-learning