BatchED icon indicating copy to clipboard operation
BatchED copied to clipboard

Getting NaNs

Open assafshocher opened this issue 1 year ago • 9 comments

Hi, When running inside network training, after a while with some probability i'm getting nan for one instance in the batch. All the eigen values and the eigen vecotrs are nan. The matrix batch in this case is 1024X64X64 and it is real and symmetric. I'm using ed_plus. It seems to have a relation to the batch size, as the same matrix that caused the nan, doesn't nan when taken alone or with a smaller batch. Ant idea of where I should investigate this? Thanks!

assafshocher avatar Mar 30 '23 23:03 assafshocher