image-paragraph-captioning icon indicating copy to clipboard operation
image-paragraph-captioning copied to clipboard

pre-trained model

Open volquelme opened this issue 6 years ago • 2 comments

Thanks for your studying and sharing! I am just wondering what dataset is used for pre-trained model Which one(MS-COCO or Visual Genome) is used? if MS-COCO used, can I get the model pre-trained on visual genome dataset?

volquelme avatar Aug 06 '19 04:08 volquelme

The Visual Genome was most likely used to train the given pre-trained model as the Visual Genome provides captions which are of paragraph-length, while the MS-COCO dataset does not.

arjung128 avatar Aug 07 '19 03:08 arjung128

ah... i got it thanks for your reply

volquelme avatar Aug 07 '19 06:08 volquelme