Chinese-FastSpeech2
Chinese-FastSpeech2 copied to clipboard
About fine-tuning issues.
I plan to fine-tune my own dataset based on the AISHELL3 model, but my dataset only has 6 speakers, while AISHELL3 has 218. When loading the model, an error occurred due to the size mismatch. Additionally, Baker dataset only has one speaker, which also doesn't match with AISHELL3. I wonder how the author dealt with this issue?