zshy1205

Results 2 comments of zshy1205

@gaoliang13 loss2 was cal with the teacher model, why you said do not need the teacher model?

@K-Won I think if your base model is more complicated, then you can not get promotion. So I think you can try use a small model, and train it with...