MeloTTS Training from Scratch Yielding Unusable Results

Training from Scratch Yielding Unusable Results

Open BankNatchapol opened this issue 10 months ago • 3 comments

Hello. I've been working on training a model from scratch using approximately 300 hours of 22kHz audio data. However, I've encountered some problems. In my language, the phenomizer isn't stable, so I've made modifications to the training script to make it character-based instead. Despite these adjustments, the results of my training have been disappointing; the model only seems to produce random noise.

Below are the losses. If you've got any ideas or tips on how to rescue my poor model from its noisy fate, I'd be incredibly grateful.

Mar 26 '24 12:03 BankNatchapol

MeloTTS MeloTTS copied to clipboard

Training from Scratch Yielding Unusable Results

MeloTTS
MeloTTS copied to clipboard