Yingfan Tao
Results
2
issues of
Yingfan Tao
Hi, yy, I was wondering why you choose `xavier_normal_(self.weight, gain=2, mode='out')` instead of `nn.init.xavier_uniform_(self.weight)` when initializing weights. And by looking through the xavier_normal_ function, I found the weights won't participate...
Hello! In the paper, t should be calculated as: data:image/s3,"s3://crabby-images/116c8/116c84bd1315ee7f27a097233a8542ccfbc075a8" alt="image" It seems to be "self.t = target_logit.mean() * 0.99 + (1-0.99) * self.t" in the code. But found "self.t =...