Iver Jordal
Iver Jordal
Could you provide a snippet of code that reproduces the problem? If possible, make one that doesn't need a multi-gpu setup to reproduce it, as I don't have such a...
Ok, but without a code example and a multi-gpu setup I won't be able to reproduce the bug at the moment. Does this bug apply to all transforms, or just...
Thanks for the pull request :) I'll find time to review this
I added VTLP to the demo script and added a speech example there: https://github.com/akashrajkn/torch-audiomentations/pull/1 I listened to the outputs, and it very much resembles a band stop filter. Is that...
data:image/s3,"s3://crabby-images/04704/047046609dc1de7daf5a8182bb36668ee2c056be" alt="vtlp_spectrograms"
I'm trying to learn about what Vocal Tract Length Perturbation is. Reading the paper, I get the idea that it's about frequency warping > For VTLP, we generate a random...
This idea has been implemented in audiomentations recently: https://github.com/iver56/audiomentations/blob/master/audiomentations/augmentations/padding.py To get a deeper understanding of this, it's also useful to run [the demo script](https://github.com/iver56/audiomentations/blob/master/demo/demo.py) in audiomentations and listen to the...
Hi fantasyRgg, and thanks for your PR 😃 Just for context, so I understand the problem you're proposing to solve, I want to ask some questions: * How large is...
Thanks for the insight :) Indeed, in your case it makes sense to apply caching like this. - [x] HDD - [x] Not very large dataset - fits in RAM...
I listened to an output from the demo script now, and I can hear a buzzy sound that was somehow added to my original sound. I did not expect this....