NRI
NRI copied to clipboard
Some difference from the paper
Dear ethanfetaya:
I learn the codes of RNNDecoder and find some difference from the equations: (14)-(16) in your paper. In your code, you do not concatenate the MSG and x as the input of GRU and there is not additional hidden state. Why? Which is right?