Optimisers.jl Allow keyword arguments for optimisers

Allow keyword arguments for optimisers

Open theabhirath opened this issue 3 years ago • 1 comments

In optimisers like AdamW, it is often the case that the learning rate and the weight decay are tweaked, but the momentum decay values are not (see PyTorch, for example, where the weight decay can be specified without having to specify the β values). Maybe this could be allowed?

May 09 '22 10:05 theabhirath

One option would be to add Base.@kwdef:

julia> Base.@kwdef struct Nesterov{T}
         eta::T = 1f-3
         rho::T = 9f-1
       end

julia> Nesterov(rho = 0.9)
ERROR: MethodError: no method matching Nesterov(::Float32, ::Float64)

julia> Nesterov(η = 1f-3, ρ = 9f-1) = Nesterov{typeof(η)}(η, ρ);

julia> Nesterov(rho = 0.9)
Nesterov{Float32}(0.001f0, 0.9f0)

One ugly feature is that you have to type the defaults twice, is there a neat way around that?

The other, unavoidable, one is that all these names become public API & need to be documented.

May 09 '22 13:05 mcabbott

Optimisers.jl Optimisers.jl copied to clipboard

Allow keyword arguments for optimisers

Optimisers.jl
Optimisers.jl copied to clipboard