audio-language topic
List
audio-language repositories
VAST
235
Stars
15
Forks
Watchers
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
ONE-PEACE
942
Stars
59
Forks
Watchers
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
GAMA
67
Stars
6
Forks
Watchers
Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
AudioLLM
66
Stars
3
Forks
Watchers
Audio Large Language Models