MCQ
MCQ copied to clipboard
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
Hi, is there any script or config that use CLIP as the initialization?
Hello, wonderful project!. Here I wonder how to finetune the pre-trained models on downstream video-text retrieval datasets like MSR-VTT, LSMDC, and MSVD? I notice that the script for zero-shot retrieval...
I want to know if there is a regression head for MVM during the MILES pretraining phase.
请问您能否共享一下去除三元组后的数据集
Given a video I want to do captioning, or as you sugest answer questioning? Is it something possible?
Hi, I'm wondering why you add three [MASK] in answers. I have seen your reply in #7, but I still don't know why the number of [MASK] and whether it...
As mentioned in table 4, there are 3 different test split. How are the specific test sets selected and how many are there? Also for the table 5, what is...
我的意思是CLIP-initialized model 的MCQ模型代码,特别是BridgeFormer与VideoFormer和TextFormer的交互部分。