joelxiangnanchen comments

Repositories
Issues
Comments

Results 1 comments of


                                            joelxiangnanchen

predict linear layer's input is just hidden states but in original paper, they combined with [L(word_embed+Wh+Uc)]

@fawazsammani Hi, in original paper, author fed linear-transformed current hidden state, previous word embedding and current context vec into prediction layer like equation above. LSTM's input is last time step's...