stable-baselines3 issues

[Question] Wrong scaled_action for continuous actions in `_sample_action()`?

7

### ❓ Question The following code samples action for an off-policy algorithm. As the comments indicate, the continuous actions obtained in line 395 should have already been scaled by tanh,...

KiwiXR

documentation

question

Added missing metrics when logging on tensorboard (#1298)

7

Co-authored-by: Riccardo Sepe Co-authored-by: Francesco Scalera Added missing metrics when logging on tensorboard (#1298) ## Description Now both the hparam_dict and the metric_dict are stored on Tensorboard ## Motivation and...

rogierz

How do I export RecurrentPPO as an onnx model？

3

### ❓ Question How do I export RecurrentPPO as an onnx model？ ### Checklist - [X] I have checked that there is no similar [issue](https://github.com/DLR-RM/stable-baselines3/issues) in the repo - [X]...

shuo-Liu

help wanted

question

Add `EnvPoolAdapter` in EnvPool section of the documentation

## Description After https://github.com/DLR-RM/rl-baselines3-zoo/pull/355#issuecomment-1425749593 ## Motivation and Context - [ ] I have raised an issue to propose this change ([required](https://github.com/DLR-RM/stable-baselines3/blob/master/CONTRIBUTING.md) for new features and bug fixes) ## Types of...

qgallouedec

Added a next_observations field to RolloutBufferSamples (closes #1328)

1

## Description - Added next_observations field and has_next_observation mask to type_aliases.py - Extended generators in buffers.py to return next_observation and has_next_observation - Added a test in test_buffers.py ## Motivation and...

euanong

duplicate

[Feature Request] Can PPO support graph style spaces?

4

### 🚀 Feature Support graph style data structure as the observation and action spaces for RL algorithms like PPO or others. ### Motivation After [version 0.25.0](https://github.com/openai/gym/releases/tag/0.25.0), [gym](https://github.com/openai/gym) has support [graph](https://www.gymlibrary.dev/api/spaces/#graph)...

BlueBug12

enhancement

what is the proper way to train model with model loading

8

### ❓ Question I want to train my environment on multiple volumes for that i am using a for loop ,and changing the image in the environment ``` from stable_baselines3...

muk465

question

[Feature Request] Store next observations and dones in RolloutBuffer

1

### 🚀 Feature Add `next_observations` and `dones` fields to the `RolloutBuffer` and the `DictRolloutBuffer` classes, similar to how it is done in the `ReplayBuffer` class. ### Motivation Currently, on-policy algorithms...

taufeeque9

enhancement

Add next_observations and dones to RolloutBuffer

1

## Description This PR adds the `next_observations` and `dones` fields to the `RolloutBuffer` and the `DictRolloutBuffer` classes. The `OnPolicyAlgorithm` class is also changed to store both these fields. Closes #1273....

taufeeque9

[Feature Request] Env checker for VecEnv

2

### 🚀 Feature Check the environment when creating a `VecEnv` ### Motivation I noticed that [`check_env`](https://github.com/DLR-RM/stable-baselines3/blob/2bb8ef5e632a0e0dda291c2cd6735da75a4fcb7e/stable_baselines3/common/env_checker.py#L319) doesn't work with `VecEnv`'s (#653), but I think it would be a good idea...

AlexPasqua

enhancement

stable-baselines3
stable-baselines3 copied to clipboard

Metadata

[Question] Wrong scaled_action for continuous actions in `_sample_action()`?

Added missing metrics when logging on tensorboard (#1298)

How do I export RecurrentPPO as an onnx model？

Add `EnvPoolAdapter` in EnvPool section of the documentation

Added a next_observations field to RolloutBufferSamples (closes #1328)

[Feature Request] Can PPO support graph style spaces?

what is the proper way to train model with model loading

[Feature Request] Store next observations and dones in RolloutBuffer

Add next_observations and dones to RolloutBuffer

[Feature Request] Env checker for VecEnv

← Metadata

Owner

Metadata

stable-baselines3 stable-baselines3 copied to clipboard

Metadata

← Metadata

Owner

Metadata

stable-baselines3
stable-baselines3 copied to clipboard