pytorch
pytorch copied to clipboard
optimizer loss becomes nan
I'm trying to use your implementation to faster optimize a problem that I've already trated using different optimizers and libraries. During the first iteration of LFBGS_B, the losses are in the first steps calculated correctly (and they are correctly decreasing), but then suddenly they become nan, and the same happen to the parameters being optimized. What can cause this behavior?