Sihan Chen

Results 3 repositories owned by Sihan Chen

VAST

235
Stars
15
Forks
Watchers

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

COSA

38
Stars
2
Forks
Watchers

Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

VALOR

259
Stars
15
Forks
Watchers

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset