pytorch-image-models icon indicating copy to clipboard operation
pytorch-image-models copied to clipboard

[FEATURE] Weights of DINO fine-tuned on Imagenet?

Open NightMachinery opened this issue 2 years ago • 5 comments

Currently, only headless weights are available for DINO ViTs. It'd be good if the weights of a version fine-tuned for Imagenet was also added. I don't know if Facebook actually published these weights, but they do have experiments on Imagenet DINO in their paper.

NightMachinery avatar Jul 06 '23 13:07 NightMachinery

Meta released a linear projection head for the DINOv2. We will be able to load it if we modify the safetensor weights.

leng-yue avatar Jul 06 '23 17:07 leng-yue

if we modify the safetensor weights.

What do you mean?

NightMachinery avatar Jul 06 '23 18:07 NightMachinery

The official DINOv2 ships the weight of backbone and head separately. We need to concat them into one weights.

leng-yue avatar Jul 07 '23 20:07 leng-yue

It also uses a non-trivial head, needs features from more than one spot, not too crazy, but needs design for a custom head + backbone that I've thought of, but not implemented...

rwightman avatar Jul 07 '23 22:07 rwightman