PitchExtractor
PitchExtractor copied to clipboard
sil loss problem
I trained 3 times with different data, the data includes talking and singing, but the eval sil loss goes up each time. Can you help me with this problem? thank you very much. @yl4579
Have you tried using data_augmentation flag?
Sorry for the late reply. I was pretty busy recently. This is likely due to some miscongfiguration here. The proportion between F0 and silence loss is not well balanced so the silence loss starts to overfit. You may try to set lambda_f0 = 0.01
or even smaller so they are on the same scale, or use a very big dataset like LibriTTS (the dataset that the pre-trained model was trained on).