Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[Bug]: MaskablePPO Inaccurate update counting when target_kl early exists

Hello, could you do a PR to fix this issue?

[Question] Continue training

Hello, > I just wanted to make sure that I'm not missing something. Is this the right way to do it? You should also save the replay buffer (see doc...

[bug] Adaptive SAC: using logarithm of entropy coefficient to compute temperature objective instead of entropy coefficient

Duplicate of https://github.com/DLR-RM/stable-baselines3/issues/36 https://github.com/DLR-RM/stable-baselines3/issues/802 and https://github.com/DLR-RM/stable-baselines3/issues/712 PS: could you do a PR that a note about that in our doc?

[Question] fps drops significantly over time

what mujoco version do you use? SB3 is only fully compatible with 0.29.1 for now. Do you have the same issue with built-env mujoco env? (for instance `HalfCheetah-v4`) I haven't...

[Question] fps drops significantly over time

I'm doing many runs on MuJoCo envs and I cannot see the effect you describe so far: https://wandb.ai/openrlbenchmark/sbx/runs/99wrpkc7?nw=nwuseraraffin (other runs are available in https://wandb.ai/openrlbenchmark/sbx/). To reproduce, use the train script...

[Question] fps drops significantly over time

Note: when using multiple envs, you should probably adjust the `n_steps` to have a constant batch size For instance: ``` JAX_PLATFORMS=cpu CUDA_VISIBLE_DEVICES= python train.py --algo ppo \ --env HalfCheetah-v4 -P...

[Feature Request] When are you planning to upgrade to Gymnasium v1.0.0

https://github.com/DLR-RM/stable-baselines3/pull/1837#issuecomment-2402932624

[Feature Request] When are you planning to upgrade to Gymnasium v1.0.0

Support added in SB3 v2.4.0a11 (pre-release, also on master), please give it a try.

[Question] Can stable-baselines3 be installed through pip without cuda dependencies? Is the CPU only docker image the only alternative?

> Can stable-baselines3 be installed through pip without cuda dependencies? You need to install torch cpu version first: https://github.com/DLR-RM/stable-baselines3/blob/56c153f048f1035f239b77d1569b240ace83c130/.github/workflows/ci.yml#L34-L35

[Question] Manually Controlling Actions During PPO Training

Hello, this is hard to answer if you don't provide a minimal example to reproduce the behavior. `.learn()` does two things (see docs): collect data and train the model (when...