bert-stable-fine-tuning
bert-stable-fine-tuning copied to clipboard
Adam Epsilon Choice
Hi authors,
Thank you for your excellent work! I just found a difference between Adam Epsilon that the paper state as 1e-6, while the example scripts on this repo are set to default as 1e-8, and all the instructions in the examples dir use the default value as 1e-8.
Do you have any instruction/recommendations for the value?
Thank you so much!

