Learning rate mentioned in paper vs run_summarization.py

Open s4sarath opened this issue 4 years ago • 0 comments

Hi ,

The learning rate mentioned in paper for summarization is around 3e-5 . But in the run_summarization.py it is mentioned as 0.32 ( default ) in the flags. In roberta_base.sh script, there is no changing happen for the learning rate.

Can anyone please update on this, as learning rate is very crucial for models like these.

Thanks

Aug 27 '21 11:08 s4sarath