InternVideo
InternVideo copied to clipboard
Can stage-3 training further improve the performance of InternVideo2 on basic video tasks
Thanks for the great work! In stage 3, the video encoder is updated to improve its support for video-centric dialogue. Will stage 3 training affect the performance on basic video tasks? Any comparisons like Table 4 is expected.