DiffSinger icon indicating copy to clipboard operation
DiffSinger copied to clipboard

How to train a model from scratch, with new data set?

Open duheyu opened this issue 3 years ago • 3 comments

The training steps given in the readme.md for DiffSinger require your saved checkpoints and your training data. Can you please indicate how train a model from scratch, with a new data set?

duheyu avatar Feb 08 '22 04:02 duheyu

This repo has supported datasets with various types:

  1. Ljspeech (open-source dataset)
  2. PopCS (our dataset)
  3. Opencpop (dataset from opencpop team) Also, we find that DiffGAN-TTS re-implemented our DiffSpeech on their own multi-spk dataset. We think that people can read and understand our data pipeline, and then train a model with a new dataset, like team DiffGAN-TTS.

MoonInTheRiver avatar Feb 09 '22 15:02 MoonInTheRiver

We've been struggling with this for a while now, without success. Can someone please advise on how to train DiffSinger from scratch?

We have a data set prepared and ready , but don't understand which commands to run to to train without the given checkpoint, and if something needs to be modified in the code. Any help is appreciated, thank you.

duheyu avatar Feb 10 '22 08:02 duheyu

Yes, please add an instruction on how to run the training steps of DiffSinger from scratch. You can use the PopCS dataset as example, just instruct how to start training from the beginning, without any saved checkpoints (creating new ones).

ghost avatar Feb 12 '22 04:02 ghost