perpett

Results 1 comments of perpett

> Hello. In Section 5.2 “Model Distillation,” the document clearly states that the smaller models (ViT-Small, ViT-Base, ViT-Large) do not use Gram loss (i.e., the Gram-anchoring technique) during distillation. Specifically,...