multimodal-representation-learning topic
List
multimodal-representation-learning repositories
RegionSpot
117
Stars
4
Forks
Watchers
Recognize Any Regions
VALOR
259
Stars
15
Forks
Watchers
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset