audio-language-pretraining topic
List
audio-language-pretraining repositories
VALOR
259
Stars
15
Forks
Watchers
Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset