multimodal-foundation-model topic

List multimodal-foundation-model repositories

VAST

235
Stars
15
Forks
Watchers

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

MJ-Bench

49
Stars
5
Forks
49
Watchers

Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

MADELEINE

64
Stars
6
Forks
64
Watchers

MADELEINE: multi-stain slide representation learning (ECCV'24)