Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[feature-request] N-step returns for TD methods

As an update, I have an experimental version of SAC + Peng Q-Lambda in the contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/tree/feat/peng-q-lambda I'm using an adapted version of the HER replay buffer (storing things by...

[Bug] HER is not updating the done flag of HER transitions

Hello, > he SB3 implementation calculates a new reward but not a new done flag. I agree but > This could be achieved by adding this is unfortunately not a...

Also log hyperparameters to the tensorboard

Hello, you forgot to fill-in the "alternatives" section ;) As an alternative, you can easily define a callback since https://github.com/DLR-RM/stable-baselines3/issues/286 was merged. It is also in the documentation: https://stable-baselines3.readthedocs.io/en/master/guide/tensorboard.html#directly-accessing-the-summary-writer. >...

Also log hyperparameters to the tensorboard

>If I understand you correctly you suggest that I register a callback myself with the hyperparameters I need? Yes, adding callbacks is in fact included in the rl zoo (I...

Also log hyperparameters to the tensorboard

As a follow-up, you can take a look at what is done in the wandb callback: https://github.com/wandb/client/blob/master/wandb/integration/sb3/sb3.py (where all hyperparameters are saved) Documentation: https://gitbook-docs.wandb.ai/guides/integrations/other/stable-baselines-3

Also log hyperparameters to the tensorboard

Related https://github.com/hill-a/stable-baselines/issues/1128#issuecomment-1124794750 (implementation by @tim99oth99e using tensorboard package directly and SB3 callback)

Also log hyperparameters to the tensorboard

@tim99oth99e Please go ahead ;) (don't forget to read the contributing guide and the PR checklist)

Also log hyperparameters to the tensorboard

> Hi guys, could you tell me what is the best practice between using a Logger such as KVWriter vs. using callbacks ? Use the callbacks whenever possible. `KVWriter` is...

[Bug] in `env_checker.py`

Hello, you are right (although in practice, sb3 will probably work in that case). We would welcome a PR that solves this issue ;)

[Bug] in `env_checker.py`

> modify the test file for env_checker? Please modify the test file for the env checker.