Antonin RAFFIN issues

Results 65 issues of


                                            Antonin RAFFIN

[Feature Request] Multi-Agent (MA) Support / Distributed algorithms (IMPALA/APEX)

Here is an issue to discuss about multi-agent and distributed agent support. My personal view on that is this should be done outside SB3 (even though it could use SB3...

enhancement

experimental

[Bug] Logger does not support same names with different namespaces

### 🐛 Bug I found that bug while working on #255 . The logger will only log one value even when two namespaces are specified. ### To Reproduce ```python from...

bug

help wanted

Generalized State-Dependent Exploration (gSDE)

**This PR is NOT meant to be merged** This branch contains the code for reproducing the results in the paper "Generalized State-Dependent Exploration for Deep Reinforcement Learning in Robotics" by...

Issue with multiple envs and determinism

Hello, Thanks for the project, it looks awesome. I've been trying to use Stable-Baselines3 on it (we created a fork to register the gym env: https://github.com/osigaud/rex-gym) and could train an...

WASM Example

Hello, First, thank you for this cool project. I was looking for the best/easiest approach to use torch in a browser (now that there is a C++ api, it should...

Noisy Cross Entropy Method (CEM)

## Description ## Context - [ ] I have raised an issue to propose this change ([required](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/master/CONTRIBUTING.md)) ## Types of changes - [ ] Bug fix (non-breaking change which fixes...

Add support for Gym 0.24

Implement PPO MPI (SB2 PPO1)

MPI can be quite useful to use multiprocessing [full potential](https://twitter.com/hardmaru/status/1260852988475658242) but it is dependency that can be tricky to install.

enhancement

help wanted

Implement D4PG

[Distributed Distributional Deterministic Policy Gradients](D4PG) Reference implementation: - https://github.com/deepmind/acme PyTorch implementation: - https://github.com/fabiopardo/tonic

enhancement

help wanted

Implement MPO

[Maximum a Posteriori Policy Optimisation (MPO)](https://arxiv.org/abs/1806.06920) Reference implementation: - https://github.com/deepmind/acme PyTorch implementation: - https://github.com/fabiopardo/tonic

enhancement

help wanted