jeremy110
jeremy110
This project doesn't support this. Maybe you can refer https://github.com/litagin02/Style-Bert-VITS2
@DAYTOY-1112 CALLHOME是8k電話對話音檔,aishell-4是會議的16k多通道音檔,如果直接拿FS-EEND_ch_90_100epo_avg_model.ckpt去fine-tune應該會有問題,另外我當初在其他方法上有做aishell-4,當時是抽取出第三通道還第四通道有點忘了,因為我發現有些通道的聲音並沒有那麼乾淨,如果將全部合併效果上會有差別,提供給你參考。
When you start training, the base model will be downloaded automatically. If you want to train a new language, you can refer to https://github.com/myshell-ai/MeloTTS/issues/120
Isn't this model already supporting Korean? Based on my experience, if you want to train a Korean language model with your own data, you can take a pre-trained Korean model...
@Colinsnow1 Sorry, I haven't trained Thai, but I have successfully trained my own language using IPA with pretrained/G.pth. I have posted my log in that issue, so you might need...
Starting training from scratch can be quite challenging as it requires a substantial amount of training data. Additionally, may I ask if you have added new symbols? If so, please...
Could you provide your loss curve and your train.list? 30,000 should be enough to achieve good results.
train.list looks normal Regarding the loss function, the g/mel, g/kl, and g/fm are almost the same, but the g/total is higher than mine by more than ten. What is your...
Indeed, it has a lot to do with enc_p.emb because I am also unsure how KR's checkpoint was trained. If your method is the same as mine, where the language...
The language I am training is Hokkien, a language spoken in Taiwan. 1. I use Chinese-wwm to fine-tune my own text. 2. There is a G2P to convert text into...