glow-tts
glow-tts copied to clipboard
Question about duration loss
Hi, I trained the model with my own dataset. However, the duration loss increases gradually after a period of time for val data, it's overfitting, right? Can I stop the duration prediction after several epochs?
Yeah, also in my experience duration predictor usually overfits quite fast. However beware of your validation set, if you have any abruptly stopped sentences.
Adding onto this question; Did anyone experimented with replacing duration predictor with FastSpeech2's?