flowtron icon indicating copy to clipboard operation
flowtron copied to clipboard

Change inference speaker to custom dataset speaker

Open urpeter opened this issue 3 years ago • 2 comments

Hello, I fine-tuned the libritts2k model on some custom data (roughly 15 minutes of speech) of mine. The output with the inference demo is pretty good, though it doesn't sound like the custom data voice. Do I have to fine-tune the model longer? The best results are typically after 5000 iterations or do I have to change some code in the inference.py file. Or do I have a grave misunderstanding on how to produce a custom dataset voice? Any advice would be welcome, thank you.

urpeter avatar Sep 21 '21 22:09 urpeter

@urpeter How did you fine-tune for few-shot synthesis if the libritts2k checkpoint does not contain layers for the current model?

Alexey322 avatar Sep 22 '21 10:09 Alexey322

@urpeter ur Can you share your config ? I have some problem when fine-tune my model. Thanks

letrongan avatar Sep 22 '21 10:09 letrongan