xiyouji6

Results 2 issues of xiyouji6

In the training process, there are abnormalities with the grad_norm, and sometimes the grad_norm value becomes inf. Part of the training log is detailed below, but this situation generally only...