hifi-gan
hifi-gan copied to clipboard
(conv_pre): Conv1d(80, 512, kernel_size=(7,), stride=(1,), padding=(3,))
(conv_pre): Conv1d(80, 512, kernel_size=(7,), stride=(1,), padding=(3,)) (0): ConvTranspose1d(512, 256, kernel_size=(16,), stride=(8,), padding=(4,)) (1): ConvTranspose1d(256, 128, kernel_size=(16,), stride=(8,), padding=(4,)) (2): ConvTranspose1d(128, 64, kernel_size=(4,), stride=(2,), padding=(1,)) (3): ConvTranspose1d(64, 32, kernel_size=(4,), stride=(2,), padding=(1,))
My understanding is that: Convolution is the process by which the out_channel becomes smaller, and deconvolution is the process by which the out_channel becomes larger. Why is it the opposite in the code?