RepDistiller
RepDistiller copied to clipboard
Why using log_softmax instead of softmax?
I think it should be softmax
instead. Otherwise p_t
and p_s
are not comparable.
Could you please explain why? https://github.com/HobbitLong/RepDistiller/blob/dcc043277f2820efafd679ffb82b8e8195b7e222/distiller_zoo/KD.py#L13-L17
for those who have similar question like i did: https://github.com/yoshitomo-matsubara/torchdistill/issues/233