DualRL how to tune hyper-parameters to balance between sentiment accuracy and BLEU score

how to tune hyper-parameters to balance between sentiment accuracy and BLEU score

Open jind11 opened this issue 5 years ago • 3 comments

hi, I tried re-running your code without changing any hyper-parameters, however, I got this result: 0-1_Test(Batch:1600) Senti:77.300 BLEU(4ref):61.125(A:55.520+B:66.730) G-score:68.738 H-score:68.267 Cost time:2.57. Could you provide some experience about how to tune the hyper-parameters so that I can balance between the sentiment accuracy and the BLEU score? Thank you so much!

Oct 02 '19 14:10 jind11

You can change 0.25 to a larger value, which can cause a better sentiment accuracy and worse content presentation.

https://github.com/luofuli/DualRL/blob/2fae5bb41e62a2c1c8bd2d439baba01c9d8e4f21/dual_training.py#L275

Note: The printed logs just show the results on one test set. 0-1_Test..... is the performance of test.0 and 1-0_Test.... is the performance of test.1.

Oct 02 '19 15:10 luofuli

I thought in the result log: A:55.520+B:66.730, A and B correspond to two directions: 0->1 and 1->0 and the BLEU score is the average of them, am I right?

Oct 03 '19 18:10 jind11

hi, Fuli, I have tried increasing the context reward coefficient from 0.25 to 1.0 and the highest sentiment accuracy I got is 78.7%, should I further increase this coefficient? Or I should tune other hyper-parameters to increase the sentiment accuracy to what is reported in the paper? Thank you so much!

Oct 06 '19 17:10 jind11

DualRL DualRL copied to clipboard

how to tune hyper-parameters to balance between sentiment accuracy and BLEU score

DualRL
DualRL copied to clipboard