Kaizhi Qian

Results 196 comments of Kaizhi Qian

@qq547276542 Thanks! More follow-up works will be released. Stay tuned.

The pretrained model is only trained on a small set of speakers, which may not generalize well to other speakers. You can use one-hot embedding if you are not doing...

The provided training data is very small for code verification purposes only.

There might be something wrong with your validation data. The validation loss should be around 30.

@c1a1o1 Please clearly state your question and create a new issue. Please do NOT flood other issues.

@jamesliu This looks like over-fitting to me. Make sure you use a large training set and the validation speakers are in the training set.

@jamesliu Your training set is actually very small, which has only 30 mins of data. Also, the "demo data" needs to be consistent with the training data.

There are many contributing factors to output quality. It is hard to tell from the information you provided. @jamesliu

You can use longer audio. There is no limit on the length of input.