EgoVLP icon indicating copy to clipboard operation
EgoVLP copied to clipboard

On the setting of `num_frames`

Open Lyman-Smoker opened this issue 1 year ago • 1 comments

Thanks for your great work!

I am curious about the setting of num_frames of the pretrained model EgoVLP_PT_BEST.

I noticed that, in one of the closed issues, you clarified that num_frames=4 is used to train EgoVLP_PT_BEST. However, in configs/pt/egoclip.json, num_frames=16 is used.

Also, when loading EgoVLP_PT_BEST using a config file with num_frames=4, I get an error of size mismatch (shown in the attached image). It seems that the num_frames is 16 in EgoVLP_PT_BEST.

Could you please further explain how the num_frames is defined across the different configs and checkpoints?

Thanks for your help in advance!

image

Lyman-Smoker avatar Dec 13 '23 05:12 Lyman-Smoker

seems #4 is related to this

Haawron avatar Jan 02 '24 19:01 Haawron