dinov2 icon indicating copy to clipboard operation
dinov2 copied to clipboard

Is there any pretrained dinov2 model with patch size of 16?

Open nagitam opened this issue 1 year ago • 3 comments

nagitam avatar Sep 01 '23 07:09 nagitam

Not at the moment; does it work for you if you resize your image before the patch embedding block, by a ratio of 14/16 ? I expect it should give the result you want

qasfb avatar Sep 01 '23 12:09 qasfb

Thanks. That was what I was thinking. The use-case is for semantic segmentation and I would have preferred to avoid rescaling.

nagitam avatar Sep 02 '23 12:09 nagitam

Got the same issue. Since as described in the paper, when linear probing the network for semantic segmentation, the patch size is 16 and the image size is 512. Even though I obtained similar results using image size 518 and patch size 14, using the code I implemented from draft.

ZichengDuan avatar Sep 26 '23 03:09 ZichengDuan