multimodal-foundation-model topic

List multimodal-foundation-model repositories

VAST

235
Stars
15
Forks
Watchers

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

MJ-Bench

35
Stars
3
Forks
Watchers

Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

MADELEINE

24
Stars
4
Forks
Watchers

MADELEINE: multi-stain slide representation learning (ECCV'24)