layer-norm
layer-norm copied to clipboard
Code and models from the paper "Layer Normalization"
Hi, From the paper "layer normalization", in section 3.1, layer normalization for rnn is used for the sum of the weighted input and weighted hidden. In the code of lngru,...
I got the following error while following step 4 in the training directory, using the pretrained model downloaded from the README file. ``` >>> import tools >>> embed_map = tools.load_googlenews_vectors()...
Hi Jimmy and Ryan, I found a discrepancy between the implementations in layers.py and torch_modules/LSTM_LN.lua. There are three LN applied in each LSTM step in layers.py while there remains only...