Flux.jl ConvRNN Cell and ConvLSTM Cell

There is an implementation in MXNet: https://github.com/apache/incubator-mxnet/pull/6832

Oct 18 '17 16:10 iblislin

Our LSTM implementation might be general enough that you can just throw a convolution into it. That would be a nice bit of reuse.

Oct 26 '17 09:10 MikeInnes

This is not true for the current LSTM implementation but could be done for Recur. Is this something people use or can we close as a non-priority?

Feb 12 '21 01:02 darsnack

Definitely something that people use. Have a need for a quick implementation of this for a image denoising application in a Julia codebase, and would love to see in Flux :). seems like PyCall + pytorch is my best bet for now (e.g. https://towardsdatascience.com/video-prediction-using-convlstm-with-pytorch-lightning-27b195fd21a2)

Apr 28 '21 03:04 tbenst

At the very least, I think it would be a good contribution to the model zoo. Whether ConvLSTM should be a core layer would require more discussion (do you do ConvRNN? ConvGRU? Is it possible to generalize the cells so they works with conv and matmul, etc.)

Apr 28 '21 03:04 ToucheSir

Following up on this, Flax does an interesting thing by using Dense layers instead of explicitly writing out the matmuls: https://github.com/google/flax/blob/main/flax/linen/recurrent.py#L113-L118. My understanding is that the ConvLSTM paper simply subs out those matmuls for convolutions, so it may be possible to parameterize cells in terms of sublayer(s) that compute results from W + x or h.

Jul 23 '21 18:07 ToucheSir