LightXML
LightXML copied to clipboard
some problem in training
I encountered an anomaly during training. At a certain batch size, P@1 dropped from 0.9 to 0.1.
Maybe gradient has exploration during training, what dataset did you use ?
AmazonCat13k, i used apex with O1. When i used O0, model has not gradient explosion.