Vision-Language-Transformer icon indicating copy to clipboard operation
Vision-Language-Transformer copied to clipboard

Confusion about data_process_v2

Open huangjy-pku opened this issue 3 years ago • 0 comments

Hello, I just checked the file 'data/data_process_v2.py', and I found something confusing.

Since in line 98 you check 'if dataset == 'refclef', apparently, you take RefClef dataset into account, not only RefCoco, Refcoco+, Refcocog. But should categories in Refclef be processed the same way like Refcoco*, as in cat_process function? I guess the cat_process function is to convert COCO 91-category to 80-category. I wonder if this works to Refclef similarly?

By the way, still in line 98, why should ['19579.jpg', '17975.jpg', '19575.jpg'] be excluded? Is there any explanation?

Your reply would be highly appreciated, thanks :)

huangjy-pku avatar Feb 14 '22 12:02 huangjy-pku