Antonin RAFFIN comments

Results 769 comments of


                                            Antonin RAFFIN

[Bug] GPU memory explodes when using Conv2D layers in Dict Observations FeatureExtractor

Hello, I will try to have a look, probably related to https://github.com/DLR-RM/stable-baselines3/issues/834, see my comment for potential solutions: https://github.com/DLR-RM/stable-baselines3/issues/834#issuecomment-1077642829

[Bug] GPU memory explodes when using Conv2D layers in Dict Observations FeatureExtractor

> have done the math for the space taken by the batch of observations in Float32: ((101*51) + (9) + (1) +(1)) * 4bytes * 10000 = 0.2 GB of...

[Bug] GPU memory explodes when using Conv2D layers in Dict Observations FeatureExtractor

> I guess I could only normalize the rewards and not the observations? yes, and you can exclude specific observation key too (recommended here). could you please give the output...

[Bug] GPU memory explodes when using Conv2D layers in Dict Observations FeatureExtractor

> have done the math for the space taken by the batch of observations in Float32: ((101*51) + (9) + (1) +(1)) * 4bytes * 10000 = 0.2 GB of...

[Proposal, Enhancement] Improve Registry through class initialisation arguments

should be closed by #842

[Question] C++ Inference

Hello, that would be a valuable extension to SB3 but should be done in the [RL Zoo](https://github.com/DLR-RM/rl-baselines3-zoo) I think (or in an external repo). > Here is how I would...

[Question] C++ Inference

> Making it some contrib to RL Zoo indeed looks like the way to go, since the export can be achieve "from the outside" or SB3. Feel free to open...

[Question] C++ Inference

Thanks for the PR =) > The export is initiated through enjoy.py, since I didn't wanted to duplicate or factor out the environment loading logic, to start the export pass...

[Question] C++ Inference

> , dropping the dependency with CMRC or making it optional Less dependencies is usually better ;) > Enabling the Python binding is more like a test than a real...

[Question] found class mlplstmpolicy in the stablebaseline, but not in Sb3. Why

For the reason why, you can read the migration guide: https://stable-baselines3.readthedocs.io/en/master/guide/migration.html Also, each algorithm page explains which spaces and policies are supported, for instance, for ppo: https://stable-baselines3.readthedocs.io/en/master/modules/ppo.html