Steven Morad issues

Repositories
Issues
Comments

Results 22 issues of


Steven Morad

[Feature Request] Normalized gradient descent

Optax has various clipping operators, but as far as I can tell, it cannot scale by gradient norm. Adding these capabilities such that they could be chained would allow us...

enhancement

[FEATURE] Add support for efficient recurrent models

### Feature [Revisiting Recurrent Reinforcement Learning with Memory Monoids](https://arxiv.org/abs/2402.09900) provides a method to combine recurrent models with standard, nonrecurrent RL losses. This should provide support for S5, LRU, FFM, Linear...

enhancement

Roadmap