Transformer-SSL Question about the detection/segmentation results

Question about the detection/segmentation results

Open amobiny opened this issue 3 years ago • 0 comments

Hi there, Congrats for the nice work and thanks for providing the code. I have a question about the experiments you conducted on downstream tasks (detection and segmentation). For the detection/segmentation results reported in Table 3, did you perform SSL on ImageNet-1K and then use the models as backbones and simply train on COCO? No SSL on COCO data, right?

And if so, could that be a reason why the MoBY model is not outperforming the supervised model? What I'm trying to understand is if we can expect a model which is SSL-trained on a large unannotated data, and then trained on the downstream tasks on a portion of the same data (which is labeled) to perform significantly better than a model which is solely trained in a supervised fashion on the annotated portion? Any insight is appreciated.

Best,

Sep 15 '21 01:09 amobiny

Transformer-SSL Transformer-SSL copied to clipboard

Question about the detection/segmentation results

Transformer-SSL
Transformer-SSL copied to clipboard