Dyongh613

Results 9 comments of Dyongh613

Hi @keonlee9420. In my work, I use the LJSpeech, and I add the diffusion mechanism to portaspeech. The first stage is tranined by 160000 steps with 64 batch. This spectrogram...

Hi @keonlee9420, thank you for your answer!  ------------------ 原始邮件 ------------------ 发件人: "keonlee9420/PortaSpeech" ***@***.***>; 发送时间: 2022年9月1日(星期四) 晚上10:03 ***@***.***>; 抄送: "Rui ***@***.******@***.***>; 主题: Re: [keonlee9420/PortaSpeech] A questions about the output of Phoneme Encoding (Issue #27) Hi @qw1260497397...

After training 5000 times with aishell3, an error is reported. File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 222, in forward duration_w_rounded, src_w_len, mel_mask)) File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 140, in add_position_enc pos_enc = coef.unsqueeze(-1) * pos_enc...

Thank you for your reply! Just now I modified max_ seq_ len as 1428. Can I ask you some questions later? I'm still in the first year of graduate school...

On inputs[11:], how to decide the value(11). Thank you!

Thank you! I still cannot understand the batch in dataset.py for you said. duration_targets in inputs[3:] is tensor(8,38), while the log_duration_predictions in predictors is tensor(8,19) which got from linguistic_encoder module. ...

log_duration_predictions and duration_roundeds in Predictors , tensor(8,19) , both calculate from Linguistic_encoder. The duration_targets in Inputs[3:] tensor with (8,38). I see there will be some errors when I calculate the...

By the way, it occurs on D_loss and adv_loss

Hi@keonlee9420, this problem has been solved! The previously passed parameters are also of type float when I debug them, but I don't know why this problem occurs. I try to...