Mysterybullet
Results
4
comments of
Mysterybullet
However, the code include model_kwargs in _vlb_terms_bpd while choose the KL loss, so i guess whether the model_kwargs is forgetten in the MSE loss (learn_sigma).
I have the same doubt, what's the reason for doing this ?
care about this too