kapre icon indicating copy to clipboard operation
kapre copied to clipboard

TODO - 2020 version

Open keunwoochoi opened this issue 3 years ago • 10 comments

  • [x] Add inverse STFT
  • [x] Add a composed layer that concatenates magnitude and phase of input complex number (which would be STFT)

keunwoochoi avatar Aug 15 '20 04:08 keunwoochoi

  • [x] mfcc?
  • [ ] time-stretch?
  • [ ] frequency shift?

keunwoochoi avatar Aug 18 '20 08:08 keunwoochoi

  • [x] fix test for phase

keunwoochoi avatar Aug 18 '20 17:08 keunwoochoi

  • [x] energy
  • [ ] loudness
  • [ ] HPSS

keunwoochoi avatar Aug 24 '20 19:08 keunwoochoi

  • [x] tf.signal.frame
  • [x] mu law
  • [ ] pcen
  • [ ] freq and time mask (spec aug)
  • [x] augmentation - swap channels
  • [ ] augmentation - one for my ldb
  • [ ] compute loudness like https://github.com/magenta/ddsp/blob/0c8c151cbddefb32d04c1b017f060ac44997d036/ddsp/spectral_ops.py#L173, which involves a-weighting (so we need backend for it)

keunwoochoi avatar Aug 25 '20 01:08 keunwoochoi

  • [x] frequency-aware conv layer? (@kkoutini any chance you'd be interested in making a PR?)

keunwoochoi avatar Aug 26 '20 16:08 keunwoochoi

  • [ ] InstaneousFrequency(pad_end=True)
  • [ ] pre-emphasis, de-emphasis
  • [ ] LPC
  • [ ] ERB
  • [ ] LFCC

keunwoochoi avatar Aug 28 '20 19:08 keunwoochoi

  • [x] change numpy requirements to 1.18.5 to be compatible with tensorflow.
  • [x] enable model name for composed layers

keunwoochoi avatar Sep 01 '20 00:09 keunwoochoi

typos .. retunrs

keunwoochoi avatar Sep 09 '20 04:09 keunwoochoi

  • [ ] random gain (control the multiplication factor)
  • [ ] random volume (control the output.. amplitude? energy?)

keunwoochoi avatar Sep 15 '20 17:09 keunwoochoi

  • [ ] more examples

  • [ ] pre-structured models?

keunwoochoi avatar Sep 29 '20 16:09 keunwoochoi