audio Additional spectral feature transformations

Additional spectral feature transformations

Open 0x00b1 opened this issue 7 years ago • 8 comments

While torchaudio provides a Mel-scaled spectrogram transformation (torchaudio.transforms.MEL), there’re a few additional spectral feature transformations that are extremely useful for pre-processing and data augmentation. For example, two feature transformations that I’d love to see in torchaudio are

[x] Mel-frequency cepstral coefficients (MFCC) -- #83
[ ] Chroma Energy Normalized (CENS) chromagrams.

Feb 19 '18 20:02 0x00b1

I might take a stab at MFCCs after my finals (approx a month from now) if you fancy waiting that long :)

Mar 16 '18 01:03 bjuncek

Anyone still looking into this? I would be interested in looking into chroma features if MFCCs are being taken care of 😄

May 11 '18 01:05 kyungyunlee

bjuncek, could you confirm that you are working on the MFCC? I would be willing to work on this if your plans have changed. Thanks

May 18 '18 16:05 MuzzleTheWolf

@MuzzleTheWolf I'm working on it but the progress is slowed down by graduation requirements. If you can get it done before June, it's all yours.

May 19 '18 14:05 bjuncek

Hey did this get anywhere? I'll look into implementing too if not.

Feb 10 '19 00:02 PCerles

I dont think anyone made progress. PRs definitely welcome :)

Feb 11 '19 06:02 soumith

https://github.com/pytorch/audio/pull/83

Feb 11 '19 23:02 PCerles

I am interested in using chroma features within torch audio. Any progress on this? It seems that this issue has been open for a long time without any interaction. If possible, can anyone point to any alternative of pytorch based implementation of chroma?

Apr 17 '23 19:04 pgimeno3

audio audio copied to clipboard

Additional spectral feature transformations

audio
audio copied to clipboard