ALBEF
ALBEF copied to clipboard
About VQA annotations
Hello, thanks for your excellent work. I'm reproducing the results in the repo. I found that the vqa_train annotation files differ from the original VQAv2 annotations. There are some answers in vqa_train that I can't find in both VQAv2 or VQAv1 annotations. Are there any data augmentation or am I missing something? An example: what is written on the bus ['buddy holly', 'buddy holly and crickets'] The two answers don't either exist in answer pools nor in the annotation files.
Hi, we use the official VQAv2 annotations. Note that QA pairs from visual genome are also used during fine-tuning.