transformer icon indicating copy to clipboard operation
transformer copied to clipboard

How to improve the result? (already add beam search)

Open trx14 opened this issue 5 years ago • 3 comments

I want to improve transformer model. I think tensor2tensor is too big to change, so I choose this code. First, I have to reproduce the previous result. I add beam search by myself(before adding the beam search, the Bleu result was 28.34) and train model on iwslt14 de-en database(many papers use this database, not the iwslt16 de-en). I set all hyper parameter same as the base model in original paper. But I only get 30.34 Bleu, however many papers said they could get 32.86. Did some one know how to improve the result? Or did some other public transformer projects easy to change?

trx14 avatar May 08 '19 06:05 trx14

Hi, I am also trying to add the beam search function. Would you mind sharing how you implemented the beam search part?

yaoyiran avatar Jun 04 '19 16:06 yaoyiran

Hello! Would you mind sharing how you implemented the beam search part? I want to learn it.

ZhichaoOuyang avatar Mar 01 '20 15:03 ZhichaoOuyang

I want to improve transformer model. I think tensor2tensor is too big to change, so I choose this code. First, I have to reproduce the previous result. I add beam search by myself(before adding the beam search, the Bleu result was 28.34) and train model on iwslt14 de-en database(many papers use this database, not the iwslt16 de-en). I set all hyper parameter same as the base model in original paper. But I only get 30.34 Bleu, however many papers said they could get 32.86. Did some one know how to improve the result? Or did some other public transformer projects easy to change?

Hello! Would you mind sharing how you implemented the beam search part?

ZhichaoOuyang avatar Mar 02 '20 09:03 ZhichaoOuyang