VideoMAE-Action-Detection
VideoMAE-Action-Detection copied to clipboard
Input resolution of train and validation
Hi authors,
I found that the input video resolution was set to 16x224x224 while the video resolution used during the validation was 16x256x352.
I might read it wrong, but I wonder how much the video resolution could affect the validation accuracy. Could you please provide results validated with 224x224?