audio-language topic

List audio-language repositories

VAST

235
Stars
15
Forks
Watchers

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

ONE-PEACE

942
Stars
59
Forks
Watchers

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

GAMA

67
Stars
6
Forks
Watchers

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities