spiglerg
spiglerg
Connected to Issue https://github.com/DLR-RM/stable-baselines3/issues/1953 ## Description RunningMeanStd is made robust to overflows with two modifications: - the product that can produce overflows when `count' becomes too large is split into...
### 🐛 Bug RunningMeanStd is not overflow safe, and overflows when running large-scale training (e.g., on a cluster). ### To Reproduce I'm submitting a pull request with a proposal to...
# Description The current code for BaseMujocoEnv requires the env metadata dictionary to have fixed, pre-specified values. While this may be useful for future API changes, it doesn't seem very...
### Proposal The current code for BaseMujocoEnv requires the env metadata dictionary to have fixed, pre-specified values. While this may be useful for future API changes, it doesn't seem very...