training-detr icon indicating copy to clipboard operation
training-detr copied to clipboard

About Train Detr by custom dataset and custom backbone

Open wangcn99 opened this issue 3 years ago • 2 comments

I am aware that you are trying to replace the backbone network of the DETR. Now,I also do the same work,but i havent get good result. Do you have any good suggestions? I think I didnt use good learning rate or other hyper paramters I use CBAM replace resnet thanks :)

wangcn99 avatar Mar 20 '21 06:03 wangcn99

Hi @wangcn99 - thanks for the update re: CBAM. I suspect there is more linkage between the DETR head and the backbone than initially thought. Thus, swapping in new backbones tends to be much more disruptive to DETR. I tried mobilenet and ResNeST, and neither did very well, so currently I still use the default Resnet-50.

One thing you could try is using AdaBelief optimizer instead of AdamW. I'm using it with GAN's right now and it's performing nicely.
That should give you better results than training with Adam(W).
adabelief_transformer

You can use their hyperparameters - adabelief is here: https://github.com/juntang-zhuang/Adabelief-Optimizer

lessw2020 avatar Mar 20 '21 22:03 lessw2020

@lessw2020 Have you tried any new experiments with switching out detr's backbone? Did you get it to work or it is still problematic with other backbones?

Also, instead of changing the backbone, do you have any experience with changing the weights for the resnet50 which i suspect should be useful when dealing with non-imagenet domains.

nikky4D avatar Oct 15 '21 21:10 nikky4D