Dual-Path-RNN-Pytorch icon indicating copy to clipboard operation
Dual-Path-RNN-Pytorch copied to clipboard

Input Normalization

Open JunzheJosephZhu opened this issue 3 years ago • 1 comments

I'm not sure if my mixing is exactly same as yours, but does your torchaudio read the wav files to int(value is typically around a couple hundred) or to float values between [-1, 1]?

I started with scipy which loads to int, and it caused loss going to NaN at a point. So I switched to librosa which loads to float

JunzheJosephZhu avatar Aug 09 '20 02:08 JunzheJosephZhu

float values between [-1, 1]. if you started with scipy which loads to int, you will norm audio between [-1,1]

JusperLee avatar Aug 09 '20 02:08 JusperLee