tensorflow-wavenet Better silence trimming

Better silence trimming

Open lemonzi opened this issue 9 years ago • 3 comments

The current implementation computes the short-time RMSE amplitude, applies a given threshold on its value (the units are raw amplitude), and trims the audio from the beginning until a frame above the threshold is found, and from the last frame above the threshold until the end. If the result is empty, that file is discarded.

At some point, we might want to implement a better algorithm, such as using a threshold relative to the maximum amplitude in the example or applying a smoothing filter. The algorithm should also be configurable.

Sep 20 '16 23:09 lemonzi

Related: #59.

Sep 21 '16 01:09 lemonzi

Related: https://github.com/librosa/librosa/issues/360

Oct 26 '16 10:10 fehiepsi

Following up on this, any updated functionality that you recommend using?

Jun 01 '18 19:06 PetrochukM

tensorflow-wavenet tensorflow-wavenet copied to clipboard

Better silence trimming

tensorflow-wavenet
tensorflow-wavenet copied to clipboard