Antonin RAFFIN comments

Results 769 comments of


                                            Antonin RAFFIN

Prioritized experience replay

> At this point, wouldn't it be clearer to put the code into common/buffers.py? yes probably, but the most important thing for now is to test the implementation (performance test,...

Prioritized experience replay

> performance test, check we can reproduce the results from the paper After some initial test on Breakout following hyperparameters from the paper, the run didn't improve or worsen DQN...

Prioritized experience replay

Some update from my part, I just added CNN support for SBX (SB3 + Jax) DQN, and it is 10x faster than the PyTorch equivalent: https://github.com/araffin/sbx/pull/49 That should allow to...

Prioritized experience replay

Some additional update: when trying to plug the PER implementation of this PR inside the Jax DQN implementation, the experience replay was the bottleneck (by a good margin, making things...

Prioritized experience replay

> Does SBX/Jax means this much speed improvement? With the right parameters (see the exact command line argument for the RL Zoo in the OpenRL benchmark organization run on W&B),...

[Feature Request] Store next observations and dones in RolloutBuffer

> Add next_observations and dones fields to the RolloutBuffer and the DictRolloutBuffer classes, similar to how it is done in the ReplayBuffer class. dones are stored in `episode_starts` (shifted by...

[Feature Request] Inform what normalization from preprocess_obs was applied to verbose = 2

Hello, > I suggest we add a log with verbose=2 that describe if preprocess_obs normalized any of the input for the network. where exactly do you want to print additional...

[Feature Request] Inform what normalization from preprocess_obs was applied to verbose = 2

> I would suggest doing it at the beginning of the training the same way we display this kind of log: I see. Will be a bit harder in that...

Prioritized Experience Replay for DQN

See https://github.com/DLR-RM/stable-baselines3/issues/622

Prioritized Experience Replay for DQN

> or you're still waiting for contributions? We are welcoming contributions =) I guess adapting https://github.com/Howuhh/prioritized_experience_replay from @Howuhh would be a good contribution.