CLAP icon indicating copy to clipboard operation
CLAP copied to clipboard

Add Data Augmentation

Open cfoster0 opened this issue 4 years ago • 3 comments

Prioritize fast, GPU-based audio augmentations that maintain the vocal content. I'd like to use nnAudio for the spectogram computation, so augmentations that use PyTorch modules are ideal.

cfoster0 avatar Apr 16 '21 07:04 cfoster0

https://github.com/asteroid-team/torch-audiomentations

cfoster0 avatar Apr 16 '21 07:04 cfoster0

https://github.com/adefossez/julius

cfoster0 avatar Apr 16 '21 07:04 cfoster0

Forgot to mention, we can also think about augmentation for the transcripts. It was mentioned that we could tag then with relevant metadata or run them through https://github.com/huggingface/torchMoji to tag with emotive emojis.

cfoster0 avatar Apr 17 '21 08:04 cfoster0