EgoVLP
EgoVLP copied to clipboard
On the setting of `num_frames`
Thanks for your great work!
I am curious about the setting of num_frames
of the pretrained model EgoVLP_PT_BEST.
I noticed that, in one of the closed issues, you clarified that num_frames=4
is used to train EgoVLP_PT_BEST. However, in configs/pt/egoclip.json
, num_frames=16
is used.
Also, when loading EgoVLP_PT_BEST using a config file with num_frames=4
, I get an error of size mismatch (shown in the attached image). It seems that the num_frames
is 16 in EgoVLP_PT_BEST.
Could you please further explain how the num_frames
is defined across the different configs and checkpoints?
Thanks for your help in advance!
seems #4 is related to this