Video-LLaMA
Video-LLaMA copied to clipboard
how to increase the numbers of input frame?
hi, authors, I want to use Video-LLaMA to infer my own dataset, I find that the current framework supports the max number of input frames as 32, if I change the frames in the config that more than 32, there is an error shown, so how to increase the frames that more than 32?
thanks!!!
I am also needing this THX!
it cannot seem to be more than 32 frames. since the input dimension of the checkpoint that the authors provide is 32*768.