openl3
openl3 copied to clipboard
Make linear frontend consistent with mel
The linear frequency spectrogram frontend results in 197 frames instead of 199, which seems to be caused by:
- In
_librosa_linear_frontend
(openl3/core.py
),center=False
should becenter=True
when callinglibrosa.stft
- In
_construct_linear_audio_network
(openl3/models.py
),pad_end=True
should be added to argument for __fix_kapre_spec - In
_construct_linear_audio_network
(openl3/models.py
), in theelse
block corresponding toif include_frontend
, theinput_shape
should account for centering
In addition to fixing these, we'll need to regenerate the regression data likely