kinetics_i3d_pytorch icon indicating copy to clipboard operation
kinetics_i3d_pytorch copied to clipboard

The padding of I3D model should be symmetrical

Open fjchange opened this issue 3 years ago • 1 comments

The Module of MaxPool3dTFPadding with kernel_size=(1,3,3), stride(1,2,2) can lead to asymmetrical padding. It would influence the output feature map, as the bottom right would be usually higher than other part of the feature map.

When I try to input a all zeros tensor into I3D model pretrained on Kinetics-400, someting strange happen, I average pooling the C and T dim and min-max norm to get a picture as below. The bottom right is much higher than other parts. grad_cam

By checking each layer output, I find out bottom right is usually have higher activation value than other part but not obvious, until mixed_5b block.

I don't know whether it hurt the model's performance, but at least it hurt the Interpretability.

fjchange avatar Sep 18 '20 10:09 fjchange

Thank you for raising this issue.

Interesting ! I am not sure whether this is influencing the numerical results as I converted the models by checking that the results were consistent on different images. I did observe a slight discrepancy with the original model (as reported in the readme) maybe this is related !

Best,

Yana

hassony2 avatar Sep 28 '20 09:09 hassony2