Optimisers.jl issues

Consistency in the type behavior of restructure

7

This was discovered in https://github.com/SciML/NeuralPDE.jl/issues/533 as an issue that only showed itself as an incorrect gradient: the primal passes of what was being trained was in Float64, the reverse passes...

ChrisRackauckas

documentation

Rename or outsource `iswriteable`

8

Through a typo, I found that Base exports [`iswritable`](https://docs.julialang.org/en/v1/base/io-network/#Base.iswritable) (one char difference). Were we to outsource this, possible candidates include [`ChainRulesCore.is_inplaceable_destination`](https://juliadiff.org/ChainRulesCore.jl/stable/api.html#ChainRulesCore.is_inplaceable_destination) (used by `add!!`) and [`ArrayInterfaceCore.ismutable`](https://juliaarrays.github.io/ArrayInterface.jl/dev/api/#ArrayInterfaceCore.ismutable).

ToucheSir

Optimizing scalars

1

It seems like Optimisers only works with vectors. ``` struct WithFloat val :: Float64 end @functor WithFloat ``` The call to `Optimisers.trainable(WithFloat(4))` returns `(val = 4.3,)`, but `destructure(WithFloat(4))` produces an...

samanklesaria

enhancement

doc improvement: working with custom model types

1

We should give more prominence in the docs to the usage of `Flux.@functor` and `Optimisers.trainable` to define trainable parameters of custom types.

CarloLucibello

documentation

Allow keyword arguments for optimisers

1

In optimisers like AdamW, it is often the case that the learning rate and the weight decay are tweaked, but the momentum decay values are not (see [PyTorch](https://pytorch.org/docs/stable/generated/torch.optim.AdamW.html), for example,...

theabhirath

`destructure` doesn't work correctly with certain functors

3

```julia using Flux using Functors using Optimisers struct Custom abc::Tuple end Functors.@functor Custom (abc,) function (f::Custom)(x) x .* f.abc[1] .+ f.abc[2] end function Custom(;dim::Int) abc = (randn(Float32, dim), randn(Float32, dim))...

rejuvyesh

bug

Add `Lookahead` optimiser

This is simpler than the version in https://github.com/FluxML/Flux.jl/pull/969, as it has no special handling for momentum, or not yet. It's unusual in that I think it needs to be written...

mcabbott

enhancement

Stable docs will 404 until a new version is tagged

2

logankilpatrick

bug

Handle gradients and object trees together

12

Flux has model structs, and Zygote would return NamedTuple gradients for them. With FluxML/Functors.jl#1 we add the ability to handle gradients in Functors.jl - in other words do a "zipped"...

DhairyaLGandhi

Move Flux optimisers docs

DhairyaLGandhi

Optimisers.jl
Optimisers.jl copied to clipboard

Metadata

Consistency in the type behavior of restructure

Rename or outsource `iswriteable`

Optimizing scalars

doc improvement: working with custom model types

Allow keyword arguments for optimisers

`destructure` doesn't work correctly with certain functors

Add `Lookahead` optimiser

Stable docs will 404 until a new version is tagged

Handle gradients and object trees together

Move Flux optimisers docs

← Metadata

Owner

Metadata

Optimisers.jl Optimisers.jl copied to clipboard

Metadata

← Metadata

Owner

Metadata

Optimisers.jl
Optimisers.jl copied to clipboard