Zhixiao Ni

Results 2 comments of Zhixiao Ni

Laughing out loud, that's exactly right. The main challenge lies in the scarcity of documentation, forcing me to speculate and explore through the code myself. In reality, even models like...

I encountered the same issue where, at the beginning of training, the loss gradually decreased but then reached a point where the training seemed to diverge, and the loss started...