Time-LLM icon indicating copy to clipboard operation
Time-LLM copied to clipboard

accelerator.prepare() with scheduler disturbe the learning rate

Open ztb-35 opened this issue 7 months ago • 0 comments

Hi, guys: Recently, I reproduce this paper. But, I found an interesting thing. The learning rate changes from 1e-3 to 4e-5 at the first step in the first epoch, and stays till the end. I am not sure if it is a bug with accelerator.prepare() or not. Any discussion is appreciated! Here is how I print the learning rate. Screenshot 2024-07-22 at 8 08 46 PM It returns 1e-3 as my setting before training. Screenshot 2024-07-22 at 8 10 08 PM During training, I get the learning rate always as 4e-5.


I also try to change the args.lradj, I tried 'type1', 'constant', and 'COS'. It is always like above said. Screenshot 2024-07-22 at 9 16 53 PM

ztb-35 avatar Jul 22 '24 12:07 ztb-35