Optimisers.jl issues

Using `adjust!` on weight decay (L2) and sign decay (L1) at the same time?

5

### Motivation and description In [other contexts](https://en.wikipedia.org/wiki/Elastic_net_regularization), combining L1 and L2 regularization can be reasonable. In Optimisers, they have the same parameter name, which, if I understand correctly, will mean...

murrellb

enhancement

L-BFGS algorithm request

2

### Motivation and description Can we implement L-BFGS? It's a quasi 2nd order method that can converge much faster, suitable for computationally intensive models with moderate number of parameters. I...

paulxshen

enhancement

`trainables` doesn't like non-trainable undef fields

```julia julia> using Optimisers julia> mutable struct Two{T}; x::T; y::T; Two(x::T) where T = new{T}(x) end julia> Optimisers.trainable(z::Two) = (; z.x) julia> t = Two([1,2,3.]) Two{Vector{Float64}}([1.0, 2.0, 3.0], #undef) julia>...

mcabbott

bug

Optimisers.jl
Optimisers.jl copied to clipboard

Metadata

Using `adjust!` on weight decay (L2) and sign decay (L1) at the same time?

L-BFGS algorithm request

`trainables` doesn't like non-trainable undef fields

do not accumulate updates in presence of shared gradient

tag v1.0.0

friendly error when using rule instead of state

← Metadata

Owner

Metadata

Optimisers.jl Optimisers.jl copied to clipboard

Metadata

Using `adjust!` on weight decay (L2) and sign decay (L1) at the same time?

L-BFGS algorithm request

`trainables` doesn't like non-trainable undef fields

do not accumulate updates in presence of shared gradient

tag v1.0.0

friendly error when using rule instead of state

← Metadata

Owner

Metadata

Optimisers.jl
Optimisers.jl copied to clipboard