Heart-eartH

Results 2 comments of Heart-eartH

谢谢您的回答,这完全解决了我的问题

> > I got nan during training, I think it is because I loaded the model as float16? > > I found that when training vitb, if I set qkv_bias...