AutoPST icon indicating copy to clipboard operation
AutoPST copied to clipboard

How to make 'mfcc_stats.pkl' and 'spk2emb_82.pkl'?

Open insunhwang89 opened this issue 4 years ago • 3 comments
trafficstars

How to make 'mfcc_stats.pkl' and 'spk2emb_82.pkl'?

I want to test another dataset.

Thanks.

insunhwang89 avatar Aug 27 '21 07:08 insunhwang89

mfcc_stats.pkl is the mean and std of the mfcc.

spk2emb_82.pkl is mapping from speaker name to one-hot embedding.

auspicious3000 avatar Aug 27 '21 14:08 auspicious3000

The mfcc_stats.pkl file also contains something called 'dctmx'.

Is it correct to find 'dctmx' with the function below? from scipy import fft dctmx = fft.dct(wav)

And does dctmx also use the average value of each speaker?

Thanks for your reply.

insunhwang89 avatar Aug 27 '21 14:08 insunhwang89

dctmx = scipy.fftpack.dct(np.eye(80), type=2, axis=1, norm='ortho')

you can just use the dctmx in mfcc_stats if you use the same spectrogram specifications

auspicious3000 avatar Aug 27 '21 15:08 auspicious3000