siamese-fc
siamese-fc copied to clipboard
why use the totalStride?
I don't understand the code
disp_instanceInput = disp_instanceFinal * p.totalStride / p.responseUp;
why *p.totalStride. cuz the stride of conv?
Yes. Usually there will be several layers in the network with stride > 1. This means that displacements in the response map need to be multiplied by the overall stride of the network.
@jvlmdr Hello, I don't quite understand the 8 coordinates in ground_truth. I used a trace with a number of coordinates of 4. I found that the coordinates of the first frame did not completely wrap the target on the target. If I use my own sequence of images, I should enter the first What should be the coordinate format of a frame of picture?