dahuaxiya
Results
3
comments of
dahuaxiya
> yes, I met the same thing too. If test the model, the loss will be nan when ema is closed.However, the loss is normal without test step.
I get 96.7% test accuracy too after 200 epochs. Do you know why now?
> Owner 是这样的,我试了一下。每个GPU只能看到所有数据的一部分,是我的计算方法有误,应该除以每个GPU看到的所有样本数量,而不是数据集所有的样本数量。 谢谢作者的解答