InternVideo
InternVideo copied to clipboard
About feature extraction from raw video using InternVideo2
Thank you for great work!
I am currently working on temporal action localization and planning to use InternVideo2-1B and 6B for feature extraction from raw video data that is not available on Hugging Face. However, I am unclear on the exact process about the feature extraction.
Could you please provide guidance or an example on how to extract features from raw video using InternVideo2?