regnet Performance for RegNetY

Performance for RegNetY

Open zhanghang1989 opened this issue 4 years ago • 4 comments

First, thanks for the awesome work of re-implementing RegNet.

I am having difficulty of reproducing the results for RegNetY-0.4GF. The configurations are taken from the original repo:

group_width = 8
initial_width = 48
slope = 27.89
quantized_param = 2.09
network_depth = 16

I only get 72.92 top-1 accuracy, but the original paper reported 74.2. Any thoughts on that?

May 25 '20 19:05 zhanghang1989

Hi, Thanks for your interest. Do you change anything else on our code (e.g data augmentation, schedule, ...)?

May 26 '20 09:05 uvipen

Thanks for the quick reply!

I used different training HPs learning rate: 0.2, batch size: 512, weight decay: 1e-4

When switching back to paper default: learning rate: 0.8, batch size: 1024, weight decay: 5e-5, I got 73.66 now.

May 26 '20 15:05 zhanghang1989

Great. Pls notice that data-preprocessing stage also influences performance. Since the paper didnt explicitly mention this step, we only applied very basic techniques. That's the reason why performance could be slightly different from paper

May 26 '20 15:05 uvipen

Thanks @uvipen Could you the results in your experiments? I saw the README is still pending.

May 26 '20 16:05 zhanghang1989

regnet regnet copied to clipboard

Performance for RegNetY

regnet
regnet copied to clipboard