MUNIT icon indicating copy to clipboard operation
MUNIT copied to clipboard

Why 4x4 Conv in this network, and where this idea come from

Open imlixinyang opened this issue 7 years ago • 4 comments

In discriminator, style encoder and content encoder, i find 4x4 conv filters. Where this idea came from or i missed something.

imlixinyang avatar Oct 29 '18 03:10 imlixinyang

4x4, stride=2, padding=1, in a word, just for downsample

ShihuaHuang95 avatar Dec 01 '18 01:12 ShihuaHuang95

From my point of view, using even-sized conv kernels is just to show that Deep networks can work with conv kernel of sizes odd or even, or none-square shapes.

doantientai avatar Jan 04 '19 15:01 doantientai

In my opinion, 4x4 kernel and 2x2 stride conv might be able to alleviate the checkerboard issues.

chychen avatar Jun 24 '19 05:06 chychen

I think it is just the originality of the DCGAN paper (Deconvolution and convolution with kernel size 4).

And there seems to be no reason in DCGAN's convolution layer.

See the following author's article. https://discuss.pytorch.org/t/in-dcgan-why-the-kernel-size-of-4-is-used/20616/2

In deconvolution, however, it is convenient to upscale the size of a feature map exactly twice.

I think that the intentions of the authors of DCGAN, which attempted to make the generator and discriminator equal, seem to have become a de facto standard.

caffeinism avatar Jul 04 '19 04:07 caffeinism