Efficient-AI-Backbones
Efficient-AI-Backbones copied to clipboard
Do you use the self-distillation during training?
@yehuitang Hello, thank you for releasing your VITAUG code. In your paper, you said you trained following the DEIT. DEIT used self-distillation during training. But I do not find it in your code. Could you please tell me whether you adopt the self-distillation during your training?
And does you implement the Fourier transformation in your code? Thank you