wyt1234
wyt1234
看了两天了,感觉还是一头雾水,虽然能感觉到作者的思想,但是这个结构还是摸不着头脑
There's one thing I don't understand。Why do have to change the one-quarter sampling rate? How does it help? Thanks.
Thank you for sharing,There's one thing I don't understand。 Why do have to change the one-quarter sampling rate of subsampling after removing it? What is the harm to VC task...
Thank you for the questions. For Q1: I adapted espnet a lot; it seems that espnet asr models always downsample the encoder input along the temporal axis more than 4x...