JamesHeald

Results 3 issues of JamesHeald

There are recurrent (LSTM) policy options for sb3 (e.g. [RecurrentPPO](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/sb3_contrib/ppo_recurrent/ppo_recurrent.py)). It would be great to have recurrent PPO implemented for sbx.

enhancement
help wanted

I have implemented a diffusion probabilistic model for the first time using a cosine noise schedule, and I am finding that the variance/standard deviation of samples drawn from the reverse...

I have created a basic environment for reaching to random targets in space with the MyoSuite arm. - I created a new myoarm_reach.xml file by modifying the existing myoarm_relocate.xml file...