MeanSum
MeanSum copied to clipboard
About the nll Loss
hi , when i train the No pre-trained language model ,why the nll loss is Nan sometimes ?