extract multi-modal features using InternVideo2

Open xeroqin opened this issue 9 months ago • 0 comments

Hi InternVideo2 team！

Could you please share a code about how you extract the multi-modal features? I'd like to use the models to extract feature of my own dataset.

Thanks for your guidance!

Mar 31 '25 11:03 xeroqin