LPCTron
LPCTron copied to clipboard
Tacotron samplerate 22050 does not match LPCNet 16000
1 Do you use same training dataset to train both Tacotron and LPCNet?
I see that you copy genarated f32 files (16000) to audio directory which is generated by T2 preprocess
And your T2 preprocess used samplerate 22050 to generate audio, mel and linear.
Does this matter?
No Need to Train LPCNet , you can use existing model. But i guess if we train with same dataset it should work better. Regarding sample rate i guess it should 16k , you might be right.
I am working on mandarin synthesis, so I need to train LPCNet from scratch.
Yes in that case you need to have same dataset . You need a single PCM files containing the audio samples for training LPCNET
So, Tacotron2 still predict mel spectrogram as condition for LPCNet? In LPCNet paper, the 20 dim features is not mel spectrogram. How it works?
@lyz04551 feature can easily be generated from Speech , so we dont need feature generated from Tacotron2 .
I think we should change the hop_size and n_fft if we use 16k audio since we may need to predict linear spectrum:
the left is original tacotron, right is this project's .
I think we should change the hop_size and n_fft if we use 16k audio since we may need to predict linear spectrum:
the left is original tacotron, right is this project's .
How about the audio quality of the lpctron you use?
I got bad quailty using 10000 samples for traning with LPCTron. what about you?
I got bad quailty using 10000 samples for traning with LPCTron. what about you?
The quality of the speech I synthesized is not very good, and the background always has some harsh sounds.
I got bad quailty using 10000 samples for traning with LPCTron. what about you?
The quality of the speech I synthesized is not very good, and the background always has some harsh sounds.
can you give me some samples?
Hey, could anyone share a recipe for increasing the sample rate?
@a-froghyar use sox
Hey @alokprasad I meant regarding the training and synthesis within the repo - I'd like to train Tacotron-2 and LPCNet with 24kHz samples.
I got bad quailty using 10000 samples for traning with LPCTron. what about you?
sorry, I want to train tacotron2+lpcnet.And I do not understand how to train LPCNet? Using the predict features by tacotron2 or the raw features?
@
Can you tell me the version for loctron?