stable-baselines3 icon indicating copy to clipboard operation
stable-baselines3 copied to clipboard

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Results 192 stable-baselines3 issues
Sort by recently updated
recently updated
newest added

### ❓ Question I copied the code from the Examples section in the documentation, which also uses a PongNoFrameskip-v4 environment with 4 stacked frames. The episodic mean reward starts out...

question

### ❓ Question 我使用PPO和A2C是可以运行的,换了DDPG和SAC就不行了,在第一个回合结束后会报下面的错误Why is my code generating this error? ```python Traceback (most recent call last): File "D:\ps\anaconda\envs\metro-env1\lib\code.py", line 90, in runcode exec(code, self.locals) File "", line 1, in File...

custom gym env
check the checklist

Connected to Issue https://github.com/DLR-RM/stable-baselines3/issues/1953 ## Description RunningMeanStd is made robust to overflows with two modifications: - the product that can produce overflows when `count' becomes too large is split into...

### 🐛 Bug RunningMeanStd is not overflow safe, and overflows when running large-scale training (e.g., on a cluster). ### To Reproduce I'm submitting a pull request with a proposal to...

bug
check the checklist

### ❓ Question Hi, I'm trying to run PPO but whenever I try to use StopTrainingOnMaxEpisodes, it gives me the assertion error: AssertionError: `dones` variable is not defined, please check...

question
more information needed
check the checklist

### ❓ Question I have a question about the optimizer initialization process in `DQNPolicy`. While working on a custom DQN model, I noticed that when creating the optimizer, we pass...

enhancement
good first issue
help wanted
question

### ❓ Question I am trying to parallelise execution of PPO training on MuJoCo environments, where each multiprocessing thread uses a slightly modified xml file to train PPO with. For...

question

### 🐛 Bug When I use wandb integrations, it didn't report a warning or error, but in the wandb website there is no metrics logged. I tried with other framework,...

bug
custom gym env
check the checklist

### 🚀 Feature Add support for multi-variable logging in the logger module using add_scalars. Enable the logger to record and visualize multiple related scalar values simultaneously using a single record...

enhancement

## Description Add a pre-commit-config yaml for the pre-commit message, fix its typos accordingly There are two open points ## Motivation and Context Automatically check codespell in pre-commit hooks, i.e....

PR template not filled