InternVideo
InternVideo copied to clipboard
Reproducing "C.1.1 How InternVideo2 Works in Feature-based Tasks"
Could you please provide a snippet showing how to select the right layers (namely last 5-th layer and 7-th layer) as input to actionformer?