DiffSinger
DiffSinger copied to clipboard
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
I think this "DiffSinger" model is based on Chinese. Please give me advice on how to train them in another language. Thank you for share!
Thank you for releasing the POPCS dataset, I was wondering the phoneme duration label is automatically labeled by MFA tool or labeled by human?
By enable the "with_spk_embed" option, and then retaining the model, can it support multi speaker singing?
How to do the DiffSinge test, why does my program report no errors but no audio files are generated?
How to do the DiffSinge test, why does my program report no errors but no audio files are generated?
Hi, I noticed that different versions of parselmouth would result in different length of the computed f0. This is also mentioned in the comment of your code. https://github.com/MoonInTheRiver/DiffSinger/blob/ae3e8f05e04ea28bd6d68ff8eb2a6ae882f7d9c0/data_gen/tts/data_gen_utils.py#L176 I think...
Thank you very much for providing PopCS for free~! When I reading your paper, I noticed you `re-trained a Montreal Forced Aligner tool` to build the dataset PopCS. Would you...
Hi, thanks for the great work! I want to inference on my own files. I generated the corresponding meta.json and tried to binarize it. But the binarizer can only generate...
Could you please give some instructions? Thank you very much
The training steps given in the readme.md for DiffSinger require your saved checkpoints and your training data. Can you please indicate how train a model from scratch, with a new...
I will give you the link within seven days. If not, email me again.