X-VLM icon indicating copy to clipboard operation
X-VLM copied to clipboard

Script to generate RegionTextJsonDataset?

Open daizuozhuo opened this issue 2 years ago • 4 comments

HI, I understand that the data cannot redistributed, but could you share the code to generate RegionTextJsonDataset from the official COCO, VG datasets so we can follow the pretraining method?

daizuozhuo avatar Jun 29 '22 02:06 daizuozhuo

Hi,

I found that some other methods just released their data. So, I will release the processed json files (image not included) in this week. Please follow up then.

zengyan-97 avatar Jun 29 '22 02:06 zengyan-97

Is the json file avaiable now?

xugy16 avatar Aug 05 '22 17:08 xugy16

yes, I am also interested! :)

mariyahendriksen avatar Sep 08 '22 11:09 mariyahendriksen

Hi,

The json files have been available for a while. Please see README for details. On the other hand, we only applied some preprocessing to filter invalid bboxes in the public data. You can download the data from the original websites, and do the filtering by yourself.

zengyan-97 avatar Sep 08 '22 12:09 zengyan-97