nanoGPT icon indicating copy to clipboard operation
nanoGPT copied to clipboard

Does the order of weight decay paramerters matter?

Open HongtaoYang opened this issue 1 year ago • 0 comments

In this line of the configure_optimizers method, the list of parameters are sorted. Just wondering does the order of params in a group matter?

HongtaoYang avatar Mar 22 '23 00:03 HongtaoYang