Leo YU

Results 3 issues of Leo YU

The paper said: "Taking inspiration from the input and forget gates in LSTM, we decompose each write into two parts: an erase followed by an add". Why? Thanks!

Hi, I try to do DSSM experiment recently, however, I stucked in your code. Would you plz help me give example data to run success? Thanks a lot!