CLAP Add Data Augmentation

Add Data Augmentation

Open cfoster0 opened this issue 4 years ago • 3 comments

Prioritize fast, GPU-based audio augmentations that maintain the vocal content. I'd like to use nnAudio for the spectogram computation, so augmentations that use PyTorch modules are ideal.

Apr 16 '21 07:04 cfoster0

https://github.com/asteroid-team/torch-audiomentations

Apr 16 '21 07:04 cfoster0

https://github.com/adefossez/julius

Apr 16 '21 07:04 cfoster0

Forgot to mention, we can also think about augmentation for the transcripts. It was mentioned that we could tag then with relevant metadata or run them through https://github.com/huggingface/torchMoji to tag with emotive emojis.

Apr 17 '21 08:04 cfoster0

CLAP CLAP copied to clipboard

Add Data Augmentation

CLAP
CLAP copied to clipboard