Otter
Otter copied to clipboard
[dataset] Related instruction IDs for LA In-context are incorrect
I was looking around the annotations for LA in-context, I noticed that the instructions specified as related instructions do not exist. Dense Caption doesn't seem to have this problem.
In [2]: import json
In [3]: with open('/path/to/LA_instructions.json') as f:
...: annotations = json.load(f)
...:
In [4]: list(annotations['data'].keys())[:10]
Out[4]:
['LACONV_00_INS_000000033471_2',
'LACONV_00_INS_000000052846_4',
'LACONV_00_INS_000000334872_3',
'LACONV_00_INS_000000319154_4',
'LACONV_00_INS_000000398214_4',
'LACONV_00_INS_000000520873_4',
'LACONV_00_INS_000000575173_3',
'LACONV_00_INS_000000087286_3',
'LACONV_00_INS_000000032286_4',
'LACONV_00_INS_000000175217_4']
In [5]: annotations['data']['LACONV_00_INS_000000033471_2']
Out[5]:
{'instruction': 'Is the bus driving down the street or pulled off to the side?',
'answer': 'The bus is driving down the street, which is crowded with people and other vehicles.',
'image_ids': ['LA_00_IMG_000000033471'],
'rel_ins_ids': ['LACONV_00_INS_000000033471_0',
'LACONV_00_INS_000000033471_1']}
In [6]: annotations['data']['LACONV_00_INS_000000033471_0']
---------------------------------------------------------------------------
KeyError Traceback (most recent call last)
Cell In[6], line 1
----> 1 annotations['data']['LACONV_00_INS_000000033471_0']
KeyError: 'LACONV_00_INS_000000033471_0'
In [7]: annotations['data']['LACONV_00_INS_000000033471_1']
---------------------------------------------------------------------------
KeyError Traceback (most recent call last)
Cell In[7], line 1
----> 1 annotations['data']['LACONV_00_INS_000000033471_1']
KeyError: 'LACONV_00_INS_000000033471_1'
Maybe I've misunderstood what related instructions are? Either way, please let me know!