lsq-net Why use fixstep lr

Why use fixstep lr

Open liu6381810 opened this issue 4 years ago • 3 comments

Thanks for your work! I have 2 questions: In the paper, the author use cos lr but why fixsLR used in your code? And now W3A2 acc has reported but I want to know whether you train W4A4 resnet18 reaching the acc in the paper?

Aug 12 '20 08:08 liu6381810

Ahhh, I cannot obtain better acc with cos decay, and I used step decay but fixed decay. And the config.yaml is just a template. Please refer the configuration file in the example folder.

I have tried W4A4, but its accuracy is less than the authors' one.

Aug 12 '20 08:08 zhutmost

Thanks for your reply, Another quesiton I want to know is have you ever verified the effect of grad_scale In my early quantization experiment(not lsq) weight lr is often much smaller than scale lr but with grad scale it's approximate decay the lr for scale, it's confused for me

Aug 12 '20 08:08 liu6381810

Sorry not yet.

Aug 13 '20 17:08 zhutmost

lsq-net lsq-net copied to clipboard

Why use fixstep lr

lsq-net
lsq-net copied to clipboard