Heart-eartH
Results
2
comments of
Heart-eartH
谢谢您的回答,这完全解决了我的问题
> > I got nan during training, I think it is because I loaded the model as float16? > > I found that when training vitb, if I set qkv_bias...