PIPNet icon indicating copy to clipboard operation
PIPNet copied to clipboard

Is the stride or feature map size influence the NME?

Open CPFelix opened this issue 3 years ago • 2 comments

I trained on my dataset and use the input size 64X80, and found the NME is larger than the regression method before used, and when I modified the input size to 256X256, NME is lower than before. @jhb86253817

CPFelix avatar Feb 13 '23 06:02 CPFelix

And I'm also curious about the loss weight.Why cls_loss_weight is bigger than reg_loss_weight, according to my analysis, the cls is easier than reg, and reg loss should be important to the final NME. Thanks for pointing out my mistake!

CPFelix avatar Feb 13 '23 08:02 CPFelix

Hi @CPFelix ,

  1. For PIPNet, if the input size is changed to 64x80, it may cause some problems. (a) E.g., 80 is not divisible by the power of 2 (the network stride), which may introduce error for localization. (b) Since ResNet has Stride 32, your heatmap has size 2x2, which is a bit meaningless for doing heatmap regression. You may need to reduce the stride of the backbone so that the size for heatmap regression and coordinate regression is balanced.
  2. I used larger weights for cls_loss so that its loss scale is comparable to reg_loss. Although cls is easier than reg, it is also important. Anyway, you may also try adjusting the weights see if improves.

jhb86253817 avatar Apr 17 '23 06:04 jhb86253817