ALBEF icon indicating copy to clipboard operation
ALBEF copied to clipboard

About VQA annotations

Open simplelifetime opened this issue 3 years ago • 1 comments

Hello, thanks for your excellent work. I'm reproducing the results in the repo. I found that the vqa_train annotation files differ from the original VQAv2 annotations. There are some answers in vqa_train that I can't find in both VQAv2 or VQAv1 annotations. Are there any data augmentation or am I missing something? An example: what is written on the bus ['buddy holly', 'buddy holly and crickets'] The two answers don't either exist in answer pools nor in the annotation files.

simplelifetime avatar Oct 26 '22 01:10 simplelifetime

Hi, we use the official VQAv2 annotations. Note that QA pairs from visual genome are also used during fine-tuning.

LiJunnan1992 avatar Oct 27 '22 00:10 LiJunnan1992