SpeechSplit icon indicating copy to clipboard operation
SpeechSplit copied to clipboard

Downsampling for VCTK corpus

Open biggytruck opened this issue 3 years ago • 1 comments

The sampling rate of the VCTK corpus is 48K Hz while the model requires the sampling rate to be 16K Hz. To match the sampling rate, I used librosa's resample function and my code looks like:

import librosa

y, sr = librosa.load(wav_file, sr=48000)
y_16k = librosa.resample(y, sr, 16000)

Is this the same code you used for downsampling the audios? I want to clarify this because I want to make sure the data distribution is the same.

biggytruck avatar Apr 19 '21 02:04 biggytruck

No, but this shouldn't matter.

auspicious3000 avatar Apr 19 '21 02:04 auspicious3000