audio icon indicating copy to clipboard operation
audio copied to clipboard

Additional spectral feature transformations

Open 0x00b1 opened this issue 7 years ago • 8 comments

While torchaudio provides a Mel-scaled spectrogram transformation (torchaudio.transforms.MEL), there’re a few additional spectral feature transformations that are extremely useful for pre-processing and data augmentation. For example, two feature transformations that I’d love to see in torchaudio are

0x00b1 avatar Feb 19 '18 20:02 0x00b1

I might take a stab at MFCCs after my finals (approx a month from now) if you fancy waiting that long :)

bjuncek avatar Mar 16 '18 01:03 bjuncek

Anyone still looking into this? I would be interested in looking into chroma features if MFCCs are being taken care of 😄

kyungyunlee avatar May 11 '18 01:05 kyungyunlee

bjuncek, could you confirm that you are working on the MFCC? I would be willing to work on this if your plans have changed. Thanks

MuzzleTheWolf avatar May 18 '18 16:05 MuzzleTheWolf

@MuzzleTheWolf I'm working on it but the progress is slowed down by graduation requirements. If you can get it done before June, it's all yours.

bjuncek avatar May 19 '18 14:05 bjuncek

Hey did this get anywhere? I'll look into implementing too if not.

PCerles avatar Feb 10 '19 00:02 PCerles

I dont think anyone made progress. PRs definitely welcome :)

soumith avatar Feb 11 '19 06:02 soumith

https://github.com/pytorch/audio/pull/83

PCerles avatar Feb 11 '19 23:02 PCerles

I am interested in using chroma features within torch audio. Any progress on this? It seems that this issue has been open for a long time without any interaction. If possible, can anyone point to any alternative of pytorch based implementation of chroma?

pgimeno3 avatar Apr 17 '23 19:04 pgimeno3