hifi-gan (conv_pre): Conv1d(80, 512, kernel_size=(7,), stride=(1,), padding=(3,))

(conv_pre): Conv1d(80, 512, kernel_size=(7,), stride=(1,), padding=(3,))

Open a897456 opened this issue 1 year ago • 1 comments

(conv_pre): Conv1d(80, 512, kernel_size=(7,), stride=(1,), padding=(3,)) (0): ConvTranspose1d(512, 256, kernel_size=(16,), stride=(8,), padding=(4,)) (1): ConvTranspose1d(256, 128, kernel_size=(16,), stride=(8,), padding=(4,)) (2): ConvTranspose1d(128, 64, kernel_size=(4,), stride=(2,), padding=(1,)) (3): ConvTranspose1d(64, 32, kernel_size=(4,), stride=(2,), padding=(1,))

My understanding is that: Convolution is the process by which the out_channel becomes smaller, and deconvolution is the process by which the out_channel becomes larger. Why is it the opposite in the code?

Oct 11 '23 10:10 a897456

hifi-gan hifi-gan copied to clipboard

(conv_pre): Conv1d(80, 512, kernel_size=(7,), stride=(1,), padding=(3,))

hifi-gan
hifi-gan copied to clipboard