Dyongh613 comments

Results 9 comments of


                                            Dyongh613

Can I ask you some questions about mel-spectrogram?

Hi @keonlee9420. In my work, I use the LJSpeech, and I add the diffusion mechanism to portaspeech. The first stage is tranined by 160000 steps with 64 batch. This spectrogram...

A questions about the output of Phoneme Encoding

Hi @keonlee9420, thank you for your answer!  ------------------ 原始邮件 ------------------ 发件人: "keonlee9420/PortaSpeech" ***@***.***>; 发送时间: 2022年9月1日(星期四) 晚上10:03 ***@***.***>; 抄送: "Rui ***@***.******@***.***>; 主题: Re: [keonlee9420/PortaSpeech] A questions about the output of Phoneme Encoding (Issue #27) Hi @qw1260497397...

Who can share the pre-trained model which is the AISHELL3

After training 5000 times with aishell3, an error is reported. File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 222, in forward duration_w_rounded, src_w_len, mel_mask)) File "D:\项目\PortaSpeech-main\model\linguistic_encoder.py", line 140, in add_position_enc pos_enc = coef.unsqueeze(-1) * pos_enc...

Dyongh613

Can I ask you some questions about mel-spectrogram?

A questions about the output of Phoneme Encoding

Who can share the pre-trained model which is the AISHELL3

Who can share the pre-trained model which is the AISHELL3

The meaning of inputs[11:] in model.loss.py

The meaning of inputs[11:] in model.loss.py

The meaning of inputs[11:] in model.loss.py

Some of the problems that occur in training

RuntimeError: Found dtype Long but expected Float