ISDA-for-Deep-Networks icon indicating copy to clipboard operation
ISDA-for-Deep-Networks copied to clipboard

Question about loss computation

Open Phoebe-ovo opened this issue 3 years ago • 3 comments

In you paper, the logit in the molecular is not changed, only the logits in the denominator are changed. However, in the code, I find all the logits are changed. https://github.com/blackfeather-wang/ISDA-for-Deep-Networks/blob/b66a594482557dada126211d65a4e9b6f4328423/Image%20classification%20on%20ImageNet/ISDA_imagenet.py#L99

Phoebe-ovo avatar May 11 '21 09:05 Phoebe-ovo

Thank you for your attention. In fact, we have v = 0 for the numerator, apparently. ;) We write this in the code for implementation convenience.

blackfeather-wang avatar May 12 '21 01:05 blackfeather-wang

Can you elaborate on v=0 for the numerator? Since the final loss goss through cross-entropy function, it seems the numerator also needs to take the same value as its corresponding component in the denominator, which gives a lower bound.

jackliu333 avatar Feb 26 '22 12:02 jackliu333

Now I see it. The additional addition in the logic of the numerator is gone because it becomes zero upon evaluation due to equal weights for the same class.

jackliu333 avatar Feb 26 '22 22:02 jackliu333