A2Net icon indicating copy to clipboard operation
A2Net copied to clipboard

I3D Feature Extraction

Open mokiki1 opened this issue 2 years ago • 2 comments

Hello, thank you very much for sharing. I have some questions about I3D feature extraction. I extract features through the links provided by you. I feel that the dimension of output features is not well understood. Assuming that the scale of the input I3D is [4,3,256,224,224] and the dimension of the final output is [4,1024,31,1,1], in the code you provide, the feature extracted by I3D should get a value in each frame. if its need to upsample?

mokiki1 avatar Apr 26 '22 11:04 mokiki1

You can directly use the feature, without upsample. The feature extraction process by I3D model would decrease the temporal length by a factor of $\frac{1}{8}$. Given 256 frames, you would obtain 32 features. In the paper, we use frame to indicate a feature vector, which correspond to 8 frames.

VividLe avatar Apr 26 '22 14:04 VividLe

Thank you for your answer. image

First, how to understand that stride = 4. Secondly, if can, I would like to know the length of the video input into the I3D model when you extract the features of a video.Finally, do you set overlap? thanks.

mokiki1 avatar May 13 '22 08:05 mokiki1