NExT-QA
NExT-QA copied to clipboard
Unknown definition of "qid" and "type" in annotations
Hi,
Thanks for the contribution of the dataset. I'm wondering if you can provide more details about each entry of the annotations, especially for "qid" and "type". Specifically, I assume "qid" identifies only one sample but found it ranging from 0 to 18. I guess "type" indicates the type of a question, for example "CW" for "Causal-Why", but there is no data of type "DB" for "Descriptive-Binary" and I don't know what "TC" means.
If I miss any web page or dataset description, please just direct me there. Thank you.
Hi, thanks for the interest. `qid' gives the question id within a video. So video_qid identifies a sample of the whole dataset. There is no binary question in the multi-choice annotations. You can find it in the NExT-OE repo. 'TC' means temporal questions of co-occurrent actions/activities, such as 'what is the kid doing when/while/as
the dog is barking? Eating.'
Thanks for your prompt reply!
Hi, thanks for the interest.
qid' gives the question id within a video. So video_qid identifies a sample of the whole dataset. There is no binary question in the multi-choice annotations. You can find it in the NExT-OE repo. 'TC' means temporal questions of co-occurrent actions/activities, such as 'what is the kid doing
when/while/as` the dog is barking? Eating.'
Hi, there are 8 types of questions, ['DC', 'CW', 'CH', 'TN', 'TC', 'DL', 'DO', 'TP'], what do they mean respectively? Are these 3 ACCs the accent types starting with D/C/T respectively?
Yes.
Yes.
Thank you. And there are 8 types of questions, ['DC', 'CW', 'CH', 'TN', 'TC', 'DL', 'DO', 'TP'], what do they mean respectively?
Hi, please refer to the evaluation file eval_mc.py..