LPCTron Tacotron samplerate 22050 does not match LPCNet 16000

1 Do you use same training dataset to train both Tacotron and LPCNet?

I see that you copy genarated f32 files (16000) to audio directory which is generated by T2 preprocess

And your T2 preprocess used samplerate 22050 to generate audio, mel and linear.

Does this matter?

May 24 '19 09:05 superhg2012

No Need to Train LPCNet , you can use existing model. But i guess if we train with same dataset it should work better. Regarding sample rate i guess it should 16k , you might be right.

May 24 '19 09:05 alokprasad

I am working on mandarin synthesis, so I need to train LPCNet from scratch.

May 24 '19 09:05 superhg2012

Yes in that case you need to have same dataset . You need a single PCM files containing the audio samples for training LPCNET

May 24 '19 09:05 alokprasad

So, Tacotron2 still predict mel spectrogram as condition for LPCNet? In LPCNet paper, the 20 dim features is not mel spectrogram. How it works?

May 24 '19 09:05 superhg2012

@lyz04551 feature can easily be generated from Speech , so we dont need feature generated from Tacotron2 .

Aug 02 '19 10:08 alokprasad

I think we should change the hop_size and n_fft if we use 16k audio since we may need to predict linear spectrum: linear_spec the left is original tacotron, right is this project's .

Aug 23 '19 01:08 lmingde

I think we should change the hop_size and n_fft if we use 16k audio since we may need to predict linear spectrum: the left is original tacotron, right is this project's .

How about the audio quality of the lpctron you use?

Aug 23 '19 01:08 lyz04551

I got bad quailty using 10000 samples for traning with LPCTron. what about you?

Aug 26 '19 07:08 superhg2012

I got bad quailty using 10000 samples for traning with LPCTron. what about you?

The quality of the speech I synthesized is not very good, and the background always has some harsh sounds.

Aug 26 '19 07:08 lyz04551

I got bad quailty using 10000 samples for traning with LPCTron. what about you?

The quality of the speech I synthesized is not very good, and the background always has some harsh sounds.

can you give me some samples?