Shao Hsuan Huang

Results 3 comments of Shao Hsuan Huang

I want to know if there's solution from it? I have the same problem here.

Thank for your suggestion. It seems that decreasing the learning rate lower than 0.01, and the loss would be improved (not NaN)?