KevinKune issues

Results 8 issues of


KevinKune

glorot初始化

我认为直接使用Xavier初始化网络参数会有一点问题，因为网络的真实输入输出并不是[2*embed_size, 1]，而是[n^2*embed_size, n^2/2]，从这个角度来看，为了保证输出与梯度的分布接近标准正态，应该采用3维的xavier [n^2/2, 2*embed_size, 1]来进行初始化。当然，从网络的角度来说，输入输出还是[2*embed_size, 1]，因此glorot的方法可能并不完全适用，更加合理的初始化可能介于2*embed_size, 1]与[n^2/2, 2*embed_size, 1]之间，目前的初始化方法容易nan可能是因为方差过大导致的，这一点可以通过调节问题粗暴控制，但是如果在最开始的几个step就产生梯度爆炸的话，调节温度也救不回来

KevinKune

glorot初始化

dropout in validation/evaluation

AFM训练中出现nan

releasing other text classification models, datasets & unlabeled corpus

gradient accumulation

Can this model be used for molecule generation?

severe bug after updating version

unexpected behavior when calculating key interactions, and asking for help when using your code