Yufan Andrew Liu

Results 3 comments of Yufan Andrew Liu

Same problem, did you solve this? @BingzeWu

What do you mean by mini-batch? I've trained this with a batch size of 64, but the model only considers single-batch training, and NaN values still appear after several steps....

Hi Zhiyi, not yet, but you may find the NaN in the input feature part, and mask then with average or some constant to start the training, unfortunately, I did...