Hari-Durai-Baskar

Results 3 issues of Hari-Durai-Baskar

Load-checkpoint for vision encoder shows a list containing bool False. Why is this behavior observed? ![image](https://github.com/OpenGVLab/InternVideo/assets/97948393/4c65cad8-bf9e-45e1-ae63-5dd2d88b8f48)

What is the exact gpu memory required to run the evaluation experiment of internvideostage2 ?

Hi do you have any docker image for s2 inference? for some reason i need to build a docker container for inference or use an available docker image for the...