pointer-network-tensorflow
pointer-network-tensorflow copied to clipboard
question about model.py
Hi,
I am reading the pointer-network codes. However, I have a question in model.py file. In model.py, line 133-134, you gathered enc_output into dec_inputs. But in the original paper, it seems that the decoder inputs should be from enc_inputs.
133 self.embeded_dec_inputs = tf.stop_gradient( 134 tf.gather_nd(self.enc_outputs, self.idx_pairs))
Could you explain this a little bit?
Best, Zhe
Thanks! I've never seen that but you're right. I made a mistake on that and the model should be trained with the ground truth input but the current code does not.
Yes. That is what I think. I use the enc_input to train and it seems that the model converged faster. But it is interesting that using enc_output the model also works well.