DiffSinger
DiffSinger copied to clipboard
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
According to README-SVS-opencpop-cascade.md, I made my own datasets and tried training. CUDA_VISIBLE_DEVICES=0 python tasks/run.py --config usr/configs/midi/cascade/mydatasets/aux_rel.yaml --exp_name MY_DATASETS_FS_EXP --reset The training went to Epoch 50 but I didn't any checkpoint...
Hello, I have issue as I try to use another english dataset. And I'm wondering why Inference from packed test set can work (`CUDA_VISIBLE_DEVICES=0 python tasks/run.py --config usr/configs/midi/e2e/opencpop/ds100_adj_rel.yaml --exp_name $MY_DS_EXP_NAME...
Hi, with data preview we have create 72 phonemes, is there a way to train the model such that it doesn't use the existing phone_set file with 62 phonemes and...
In the e2e trainning mode of opencpop, skip_decoder is true and the decoder part is not trainned at all, right? But in the inference, you still use run_decoder to get...
hello,is there 22050 sample rate feature trained hifigan models for svs? thanks
hello. Thanks for sharing your great research! I'm going to do an experiment to see the effect of data size on diffsinger model training. So, I want to train using...
Hi, @MoonInTheRiver, thank you for your great work. I have some questions about the training of vocoder as follows: 1. In the training phase, did you use the same loss...
HuggingFace -> Hugging Face
 When I train FFT-Singer from scratch, An AttributeError occur. How do I solve this problem? Thanks.