Antonin RAFFIN
Antonin RAFFIN
As an update, I have an experimental version of SAC + Peng Q-Lambda in the contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/tree/feat/peng-q-lambda I'm using an adapted version of the HER replay buffer (storing things by...
Hello, > he SB3 implementation calculates a new reward but not a new done flag. I agree but > This could be achieved by adding this is unfortunately not a...
Hello, you forgot to fill-in the "alternatives" section ;) As an alternative, you can easily define a callback since https://github.com/DLR-RM/stable-baselines3/issues/286 was merged. It is also in the documentation: https://stable-baselines3.readthedocs.io/en/master/guide/tensorboard.html#directly-accessing-the-summary-writer. >...
>If I understand you correctly you suggest that I register a callback myself with the hyperparameters I need? Yes, adding callbacks is in fact included in the rl zoo (I...
As a follow-up, you can take a look at what is done in the wandb callback: https://github.com/wandb/client/blob/master/wandb/integration/sb3/sb3.py (where all hyperparameters are saved) Documentation: https://gitbook-docs.wandb.ai/guides/integrations/other/stable-baselines-3
Related https://github.com/hill-a/stable-baselines/issues/1128#issuecomment-1124794750 (implementation by @tim99oth99e using tensorboard package directly and SB3 callback)
@tim99oth99e Please go ahead ;) (don't forget to read the contributing guide and the PR checklist)
> Hi guys, could you tell me what is the best practice between using a Logger such as KVWriter vs. using callbacks ? Use the callbacks whenever possible. `KVWriter` is...
Hello, you are right (although in practice, sb3 will probably work in that case). We would welcome a PR that solves this issue ;)
> modify the test file for env_checker? Please modify the test file for the env checker.