audio-language-pretraining topic

List audio-language-pretraining repositories

VALOR

259
Stars
15
Forks
Watchers

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset