audio-captioning topic
awesome-sound_event_detection
Reading list for research topics in Sound AI
muscaps
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
ClipCap
Using pretrained encoder and language models to generate captions from multimedia inputs.
dcase_2020_T6
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6
dcase-2020-baseline
Audio captioning baseline system for DCASE 2020 challenge.
song-describer
Song Describer is a data collection platform for annotating music with textual descriptions.
aac-datasets
Audio Captioning datasets for PyTorch.
sound_ai_progress
Tracking states of the arts and recent results (bibliography) on sound tasks.
fense
Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.
clotho-dataset
Python code for handling the Clotho dataset.