joelxiangnanchen

Results 3 issues of joelxiangnanchen

Hi, thx for your guys' great works. Is there any compressive transformers pre-trained model releasing plan? Want to see CT as backbone of GPT2 or BERT on long document like...

Hi, Thx for your great tutorial with nice guide and code. After I read decoder's code, I found that you just use lstm's hidden states to compute the next word's...

RT. thx for your nice work to enhance BERT with discourse awareness!