dfdx
dfdx copied to clipboard
Add Weight Decay to optimizers
- [ ] Sgd
weight_decay: Option<f32>
- [ ] RMSprop
weight_decay: Option<f32>
- [ ] Adam/AdamW:
weight_decay: Option<AdamDecay>
, whereenum AdamDecay { Vanilla(f32), AdamW(f32) }
.
(maybe use different terms for AdamDecay
).