Rishikesh (ऋषिकेश)
Rishikesh (ऋषिकेश)
> > detached hidden state > > @rishikksh20 Does this refer to text encoder output? yes
@v-nhandt21 yes conformer in TTS is modified version of ASR one.
ok, I will modify r9y9/Tacotron-2 code and add your style embedding code in that and then will see hows it's working.
@syang1993 what about loss function ?
@syang1993 thanks! Is it possible to integrate Tacotron 1 with wavenet vocoder, as the GST Tacotron paper has mentioned that they have tested it on Wavenet, so I think it...
@syang1993 ok got it , issue with Tacotron 1 might be due to receptive field width. Anyways, regarding Tacotron 2 just adding style embedding part of your code enough (though...
@karamarieliu could you share your work with me, I am also working on this issue.
@karamarieliu means GST-Tacotron 1 with wavenet_vocoder running fine ? Do you have any voice sample of that? Because I tried to integrate gst-tacotron (based on Tacotron 1) with wavenet vocoder...
@karamarieliu can you share how to train Wavenet (with gst Tacotron 1) here and how to synthesise audio though I followed the and figure out how to systhesize but it...
@zhaoguangxiang Use this repo: https://github.com/rosinality/vision-transformers-pytorch pre-processing and training script for CvT model.