Dyongh613
Dyongh613
Hi @keonlee9420. In my work, I use the LJSpeech, and I add the diffusion mechanism to portaspeech. The first stage is tranined by 160000 steps with 64 batch. This spectrogram...
Hi @keonlee9420, thank you for your answer! ------------------ 原始邮件 ------------------ 发件人: "keonlee9420/PortaSpeech" ***@***.***>; 发送时间: 2022年9月1日(星期四) 晚上10:03 ***@***.***>; 抄送: "Rui ***@***.******@***.***>; 主题: Re: [keonlee9420/PortaSpeech] A questions about the output of Phoneme Encoding (Issue #27) Hi @qw1260497397...
After training 5000 times with aishell3, an error is reported. File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 222, in forward duration_w_rounded, src_w_len, mel_mask)) File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 140, in add_position_enc pos_enc = coef.unsqueeze(-1) * pos_enc...
Thank you for your reply! Just now I modified max_ seq_ len as 1428. Can I ask you some questions later? I'm still in the first year of graduate school...
On inputs[11:], how to decide the value(11). Thank you!
Thank you! I still cannot understand the batch in dataset.py for you said. duration_targets in inputs[3:] is tensor(8,38), while the log_duration_predictions in predictors is tensor(8,19) which got from linguistic_encoder module. ...
log_duration_predictions and duration_roundeds in Predictors , tensor(8,19) , both calculate from Linguistic_encoder. The duration_targets in Inputs[3:] tensor with (8,38). I see there will be some errors when I calculate the...
By the way, it occurs on D_loss and adv_loss
Hi@keonlee9420, this problem has been solved! The previously passed parameters are also of type float when I debug them, but I don't know why this problem occurs. I try to...