InternVideo
InternVideo copied to clipboard
why InternVid-200M performance is not as good as ViClip-InternVid-10M-FLT.pth
hi, since 200M pretrained dataset is much bigger than 10M version, so why the zero shot performance is not superior than 10M?
i guess it's because 10m is the subset that somehow max the data diversity