efficientvit
efficientvit copied to clipboard
Any plan for dinov2?
DinoV2 has shown remarkable performance on downstream tasks, but its use of Vision Transformer (ViT) is computationally inefficient. Do you have plans to train an efficientVIT version of DinoV2?