DiffSinger icon indicating copy to clipboard operation
DiffSinger copied to clipboard

decoder part in e2e trainning using opencpop dataset

Open Liujingxiu23 opened this issue 3 years ago • 0 comments

In the e2e trainning mode of opencpop, skip_decoder is true and the decoder part is not trainned at all, right? But in the inference, you still use run_decoder to get mel_out and use it as a start for q_sample, right? Why run_decoder can also used here?

Is that why you use k=60 in cascade mode but k=1000 in e2e mode?

Liujingxiu23 avatar Sep 22 '22 09:09 Liujingxiu23