multimodal-pretraining topic

List multimodal-pretraining repositories

Awesome_Matching_Pretraining_Transfering

397
Stars
47
Forks
Watchers

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

Emu

1.6k
Stars
85
Forks
20
Watchers

Emu Series: Generative Multimodal Models from BAAI

mPLUG-2

216
Stars
17
Forks
Watchers

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)

Youku-mPLUG

281
Stars
11
Forks
Watchers

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks