nfnets_pytorch icon indicating copy to clipboard operation
nfnets_pytorch copied to clipboard

About WSconv2d

Open fancyshun opened this issue 4 years ago • 3 comments

I see the authoe's code, I find his WSconv2d pad_mod is 'same'. Pytorch's conv2d dono't have pad_mode, and I think your padding should greater 0, but I find your padding always be 0. I want to know why?

I see you train.py your learning rate is constant, why? Thank you!

fancyshun avatar Sep 09 '21 00:09 fancyshun

You are right, on first sight, the Stem layers need an additional padding='1' (all convolutions with kernel size 3). I will take a closer look soon.

It is constant, because I did not implement something fancier yet. What do you think it should be?

Thank you!

benjs avatar Sep 09 '21 06:09 benjs

You are right, on first sight, the Stem layers need an additional padding='1' (all convolutions with kernel size 3). I will take a closer look soon.

It is constant, because I did not implement something fancier yet. What do you think it should be?

Thank you!

when kernel size is 3, padding should be different, because stride is 1 or 2. I see the author's code learning rate is changed by warm up cosine decay. I also want to know if I train with multiple gpus, how to change batch size.

fancyshun avatar Sep 09 '21 07:09 fancyshun

This repo does not support multi gpu training, you can only reuse the model code for that.

benjs avatar Sep 09 '21 08:09 benjs