ImageNet21K icon indicating copy to clipboard operation
ImageNet21K copied to clipboard

When using your ImageNet21K pretrained ResNet50 model in Detectron2, performance degrades

Open miznchimaki opened this issue 3 years ago • 1 comments

Thanks for your great work! I have a question when using your ResNet50 model as pretrained weights of Faster R-CNN in Detectron2: your 21K pretrained weights gives 8 point lower mAP than MSRA 1K pretrained one. Before I loaded your 21K pretrained weights into the Faster R-CNN in Detectron2, I noticed that your ResNet50 was trained by input whose value is between 0 and 1 (this is achieved by dividing 255 in pixel-wise manner in your code), but the input in Detectron2 was normalized by substractig pixel mean value and dividing std value in ImageNet, so I set the pixel mean value to 0 and std value to 255 in Detectron2. Although I have done above steps, performance of Faster R-CNN based on your 21K pretrained model still lays far behind MSRA's 1K pretrained one. So I want to know is there some problems I ignored? Sincerely waiting your response!

miznchimaki avatar Feb 19 '22 11:02 miznchimaki

Not sure if you've already solved the issue. Detectron2 ResNet is caffe style, which is slightly different architecture from the TorchVision version.

zhanghang1989 avatar Jun 02 '23 00:06 zhanghang1989