MaskDINO About the training on our datasets

Thansk, we have learned a lot from your excellent work, we train the MaskDINO on a self-built datasets containing 2000 images, after 40000 iters, the metrics are nearly AP,AP50,AP75,APs,APm,APl 74.8523,92.0707,81.0479,44.8492,79.8165,77.5198

the metrics are pretty good! however, after the visulize of prediction results, there are much boundboxes around a same object, and there are much small areas on the big object,

Since DETR serial methods generally do not need to use NMS, I may want to know do you meet this question in the testing phase, how to solve this problem?

thanks again

Mar 30 '23 05:03 hhuxuchang

and this, two much small regions are segmented and apper on the big objectes.

Mar 30 '23 06:03 hhuxuchang

The AP is pretty high, therefore the predictions are supposed to have very few duplicate predictions. As we are using the sigmoid focal loss, so for visualization, you need to set a threshold for the predictions, for example, only select the prediction with a classification score larger than 0.3. You can also change this threshold to find the optimal one.

Mar 30 '23 07:03 FengLi-ust

thanks for your response, we have tried to change the OVERLAP_THRESHOLD, OBJECT_MASK_THRESHOLD (in config file) and confidence-threshold (in demo.py) But it does not work.

where to change the classification score and any other threshold ?

thanks again

Mar 30 '23 08:03 hhuxuchang

Meanwhile, as the output in dashboard 'detected 100 instances in 3.34s' since the query number is set as 100 there are only about 10 instances in the images, should we minimize the query numbers to obtain better results?

Mar 30 '23 08:03 hhuxuchang

You need to write a visualization script, in which you manually select the queries with classification scores larger than 0.3, and then visualize them.

100 queries is good in your case.

Mar 30 '23 08:03 FengLi-ust

thanks very much learnd a lot from u

Mar 30 '23 08:03 hhuxuchang