PitchExtractor icon indicating copy to clipboard operation
PitchExtractor copied to clipboard

sil loss problem

Open MMMMichaelzhang opened this issue 2 years ago • 2 comments

Screenshot from 2022-06-19 22-30-27 Screenshot from 2022-07-03 13-01-58 I trained 3 times with different data, the data includes talking and singing, but the eval sil loss goes up each time. Can you help me with this problem? thank you very much. @yl4579

MMMMichaelzhang avatar Jul 03 '22 05:07 MMMMichaelzhang

Have you tried using data_augmentation flag?

skol101 avatar Jul 08 '22 16:07 skol101

Sorry for the late reply. I was pretty busy recently. This is likely due to some miscongfiguration here. The proportion between F0 and silence loss is not well balanced so the silence loss starts to overfit. You may try to set lambda_f0 = 0.01 or even smaller so they are on the same scale, or use a very big dataset like LibriTTS (the dataset that the pre-trained model was trained on).

yl4579 avatar Aug 22 '22 18:08 yl4579