sagan-pytorch icon indicating copy to clipboard operation
sagan-pytorch copied to clipboard

Why so many ConvBlock(512, 512 ???

Open c1a1o1 opened this issue 6 years ago • 2 comments

    self.conv = nn.ModuleList([ConvBlock(512, 512, n_class=n_class),
                               ConvBlock(512, 512, n_class=n_class),
                               ConvBlock(512, 512, n_class=n_class,
                                         self_attention=True),
                               ConvBlock(512, 256, n_class=n_class),
                               ConvBlock(256, 128, n_class=n_class)])

c1a1o1 avatar Jul 05 '18 07:07 c1a1o1

Deeper & wider network gave better results. As conv block has only 1 conv module, network is not very deep.

rosinality avatar Jul 05 '18 07:07 rosinality

It's even better to use more ConvBlocks, but fewer filters. Deeper nets tend to learn better features. I would say the current model architecture is too wide, which leads to many "dead" kernels

oleksandrlazariev avatar Nov 21 '18 09:11 oleksandrlazariev