Kaizhi Qian comments

Results 196 comments of


                                            Kaizhi Qian

How to generate mel spectrogram

@liveroomand Yes in our case. but you can design your own speaker encoder or just use onehot embedding

How to generate mel spectrogram

@miaoYuanyuan For other dataset, you need to tune the parameters of the conversion model instead of the parameters of the feature.

How to generate mel spectrogram

@miaoYuanyuan If you change the parameters of features, you will need to retrain the wavenet-vocoder as well.

data preprocessing and final loss value

Please refer to the data preparation code for details

data preprocessing and final loss value

You don't need to train them at the same time.

data preprocessing and final loss value

You can simply replace G with P along with some other minor modifications.

data preprocessing and final loss value

All preprocessing steps are in the code, except trimming silence. But I don't think they will make any fundamental difference. Your loss value looks fine.

How to generate metadata.pkl file ?

The train.pkl is intended for training.

How to generate metadata.pkl file ?

For testing, please refer to this issue #108

How to generate metadata.pkl file ?

the .pkl is not a format, it is just a suffix of the filename. You can name it whatever you like such as .abc, .qaz, or .wsx, etc. To save...