Antonin RAFFIN
Antonin RAFFIN
Here is an issue to discuss about multi-agent and distributed agent support. My personal view on that is this should be done outside SB3 (even though it could use SB3...
### 🐛 Bug I found that bug while working on #255 . The logger will only log one value even when two namespaces are specified. ### To Reproduce ```python from...
**This PR is NOT meant to be merged** This branch contains the code for reproducing the results in the paper "Generalized State-Dependent Exploration for Deep Reinforcement Learning in Robotics" by...
Hello, Thanks for the project, it looks awesome. I've been trying to use Stable-Baselines3 on it (we created a fork to register the gym env: https://github.com/osigaud/rex-gym) and could train an...
WASM Example
Hello, First, thank you for this cool project. I was looking for the best/easiest approach to use torch in a browser (now that there is a C++ api, it should...
## Description ## Context - [ ] I have raised an issue to propose this change ([required](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/CONTRIBUTING.md)) ## Types of changes - [ ] Bug fix (non-breaking change which fixes...
## Description ## Context - [ ] I have raised an issue to propose this change ([required](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/CONTRIBUTING.md)) ## Types of changes - [ ] Bug fix (non-breaking change which fixes...
MPI can be quite useful to use multiprocessing [full potential](https://twitter.com/hardmaru/status/1260852988475658242) but it is dependency that can be tricky to install.
[Distributed Distributional Deterministic Policy Gradients](D4PG) Reference implementation: - https://github.com/deepmind/acme PyTorch implementation: - https://github.com/fabiopardo/tonic
[Maximum a Posteriori Policy Optimisation (MPO)](https://arxiv.org/abs/1806.06920) Reference implementation: - https://github.com/deepmind/acme PyTorch implementation: - https://github.com/fabiopardo/tonic