mvits_for_class_agnostic_od icon indicating copy to clipboard operation
mvits_for_class_agnostic_od copied to clipboard

using my own custom dataset

Open nikky4D opened this issue 3 years ago • 2 comments

I would like to finetune on my own dataset. Do you have recommendations on how I can create my own dataset for this?

nikky4D avatar Feb 03 '22 16:02 nikky4D

I have a question on the pretraining. I want to pretrain only on my dataset. Can i modify pretrain.json to only specify path to my dataset? what else should I change to get pretraining?

nikky4D avatar Feb 23 '22 08:02 nikky4D

I have a question on the pretraining. I want to pretrain only on my dataset. Can i modify pretrain.json to only specify path to my dataset? what else should I change to get pretraining?

Hi @nikky4D,

Thank you for your interest in our work. We use the same setup as of MDETR for pretraining our model. Specifically, we trained on approximately 1.3 M image-caption pairs from GQA, COCO & Flicker.

In order to train on your custom dataset, you will need to convert your dataset in COCO format containing captions and tokens_positive defining alignment with the bounding boxes. The issue at explains the required format of tokens_positive. Further, the standard data loader used can be found at.

In addition to that, you can also evaluate MDef-DETR on your dataset without any pretraining/fine-tuning. Please refer to this issue for details.

I hope this information will be helpful.

mmaaz60 avatar Feb 24 '22 23:02 mmaaz60