unsuper_tts_asr Question about DAE loss

Question about DAE loss

Open ky941122 opened this issue 3 years ago • 0 comments

Hi, I have read your paper: Almost Unsupervised Text to Speech and Automatic Speech Recognition, and I really like its idea. I have a little question about the dae loss (L_dae) in it. When calculating L_dae, do you input C(x) to Decoder and calculate the cross attention with Encoder's outputs, then use the Decoder's outputs to calculate the L_dae? Or do you just use the output of Encoder to calculate L_dae and does not involve the Decoder part, just like a traditional MLM task.

Apr 19 '21 10:04 ky941122

unsuper_tts_asr unsuper_tts_asr copied to clipboard

Question about DAE loss

unsuper_tts_asr
unsuper_tts_asr copied to clipboard