Teacher-free-Knowledge-Distillation
Teacher-free-Knowledge-Distillation copied to clipboard
KD loss is zero
My loss after distillation is 0, which feels very strange. I want to ask whether there is a problem with the distillation method or the calculation of distillation function in the code. A little confused, I hope the writer or someone who knows can tell me.Thanks.