beats-conformer-bart-audio-captioner icon indicating copy to clipboard operation
beats-conformer-bart-audio-captioner copied to clipboard

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

Results 3 beats-conformer-bart-audio-captioner issues
Sort by recently updated
recently updated
newest added

Hi, Shih-Lun Wu, I'm trying to reproduce the model training process implemented in the paper, but i found there's only test process of the model explained in the readme file.

Hi, could you add the training script into the repo?

There is no code related with training, is this code incomplete or there are no plans to open source the training code