Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[Feature Request] ACERAC

hello, > fine-time discretization Could you give a quick example/short explanation of what exact problem it solves that it not solved by other methods?

Add `NStepReplayBuffer` and `n_steps` arguments for off-policy algorithms

Thanks a lot @richardjozsa for reviewing =) > calculate the discount for self.n_steps as constant, Good catch, there is an issue with the discount factor used for bootstrapping (`model.gamma` is...

Add `NStepReplayBuffer` and `n_steps` arguments for off-policy algorithms

Some benchmark results. Note: n-steps doesn't always improve performance (and usually `n_steps=3` gives equal or better results) ## SAC - LunarLanderContinuous-v3 ``` python train.py --algo sac --env LunarLanderContinuous-v3 -P --n-eval-envs...

[Bug]: Is sb3_contrib/common/maskable/utils.py the cause of "WARN: env.action_masks to get variables from other wrappers is deprecated and will be removed in v1.0"?

Related to https://github.com/DLR-RM/stable-baselines3/pull/1837

[Feature Request] same random seed for every env in AsyncEval

Hello, I think you are missing an important alternative, which is also recommended: evaluating each candidate for multiple episodes to remove noise due to env stochasticity. Also, even if you...

[Bug]: `is_image_space` works poorly with Gymnasium's `FrameStackObservation`

Hello, probably related to https://github.com/DLR-RM/stable-baselines3/issues/1500. > This is precisely what sb3 does in their VecFrameStack! In general you should use `VecFrameStack()` instead yes.

[Bug]: `is_image_space` works poorly with Gymnasium's `FrameStackObservation`

> I am wondering why not setting the check to be >=3 instead of strictly equal. https://github.com/DLR-RM/stable-baselines3/blob/fa21bce04ee625c67f6ea2a7678bf46c39cd226c/stable_baselines3/common/preprocessing.py#L35 To me `(2, 3, 64, 64)` looks like a batch of images and...

Antonin RAFFIN

[Feature Request] ACERAC

Add `NStepReplayBuffer` and `n_steps` arguments for off-policy algorithms

Add `NStepReplayBuffer` and `n_steps` arguments for off-policy algorithms

[Bug]: Is sb3_contrib/common/maskable/utils.py the cause of "WARN: env.action_masks to get variables from other wrappers is deprecated and will be removed in v1.0"?

[Feature Request] same random seed for every env in AsyncEval

[Bug]: `is_image_space` works poorly with Gymnasium's `FrameStackObservation`

[Bug]: `is_image_space` works poorly with Gymnasium's `FrameStackObservation`

[Bug]: SubprocVecEnv ignores specified CUDA device and uses GPU 0

Read version for docker image from text file

Read version for docker image from text file