Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[Feature Request] independently configurable learning rates for actor and critic

>i cannot reproduce the work of other publications as most of the literature does exactly that. TD3 is nice, but using an different algorithm is less comparable. TD3 is literally...

[Feature Request] independently configurable learning rates for actor and critic

Re-opening as it might be helpful to be able to tune `qf_learning_rate` as done in https://github.com/araffin/sbx. We need to see how much complexity it adds though.

[Feature Request] independently configurable learning rates for actor and critic

> can I not do this because they share the optimizer? yes > If so, can I change the optimizer so they can or would you not recommend this? you...

[Feature Request] independently configurable learning rates for actor and critic

> Is this feature request still under consideration yes, but help from contributors is needed. Otherwise, you can have a look at SBX: https://github.com/araffin/sbx > Is this the only difference...

[Bug]: Missing metrics when logging hyperparameters on tensorboard

@timothe-chaumont as you did the implementation in https://github.com/DLR-RM/stable-baselines3/pull/984, could you have a look? > `new_logger.dump()` I would expect `dump(num_timesteps)` there

[Question] I do not understand the GPU and memory usage of SB3

Hello, there is an important information missing which is your network architecture. The rollout buffer store things in the RAM not on the GPU. And most GPU memory is taken...

[Question] I do not understand the GPU and memory usage of SB3

This should answer your question: https://github.com/DLR-RM/stable-baselines3/blob/aab545901fe331814f822060d677f22191cba419/stable_baselines3/common/buffers.py#L391-L398

[Question] I do not understand the GPU and memory usage of SB3

https://github.com/DLR-RM/stable-baselines3/pull/1720#issuecomment-1776700957

[Question] I do not understand the GPU and memory usage of SB3

> the number of used environments improves the GPU usage? are you using subprocesses? if so, that might be due to the way python multiprocessing work. > This should mean...

Prioritized experience replay

> Thanks @araffin ! Out of curiosity, may I ask why the switch between torch and numpy for the backend? to be consistent with the rest of the buffers and...