LIR-for-Unsupervised-IR About KL Loss

About KL Loss

Open qibao77 opened this issue 3 years ago • 4 comments

The work is interesting! However, in your paper, you only add a KL divergence loss to regularize the distribution of the noise code, while you add KL loss to all latent features in your open source code. Why is there such a difference? Is KL loss important to the final result？

Aug 18 '20 13:08 qibao77

Adding the KL loss for latent codes is only used to validate the effects of loss functions in my exps, e.g., jointing gan loss and the KL loss, but the results in my exps show it has a little effect for metrics. You could also remove it from the source codes.

Aug 19 '20 05:08 Wenchao-Du

Thank you for your reply!

Aug 20 '20 03:08 qibao77

Adding the KL loss for latent codes is only used to validate the effects of loss functions in my exps, e.g., jointing gan loss and the KL loss, but the results in my exps show it has a little effect for metrics. You could also remove it from the source codes.

Another problem, I found that the KL loss in your code is actually L2 regularization (only using the mean), but the KL loss of VAE should include mean and var. Why is there such a difference?

Sep 02 '20 03:09 qibao77

I have the same question about the KL loss,. It seems like the author only use the L2 to make the mean to zero, which is different from the regular KL divergence. So can you explain the difference?

Mar 26 '21 02:03 yuguochencuc

LIR-for-Unsupervised-IR LIR-for-Unsupervised-IR copied to clipboard

About KL Loss

LIR-for-Unsupervised-IR
LIR-for-Unsupervised-IR copied to clipboard