Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[question] EvalCallback using MPI

> @araffin has anything changed with regards to SB3 supporting the MPI or its still not supported? It is not (https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/11, https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/issues/45), but contribution is welcomed ;) But with SB3,...

[Question] How best to implement self-play/multiple agents in the same environment?

Hello, I think @AdamGleave tackled that problem in the [Adversarial policies](https://github.com/HumanCompatibleAI/adversarial-policies) repo, you should take a look ;)

Add minimal TF2 support

I would rather keep a separate branch for minimal tf2 support, as this requires tf>=1.15. Regarding tf-contrib, all the issues are: https://github.com/Stable-Baselines-Team/stable-baselines/search?q=contrib I'm also afraid of breaking previously saved models....

Add minimal TF2 support

> but we could inform users somehow that they should install the branch version yes, in the doc and readme. For the `setup.py`, there is already a version limit there....

Add minimal TF2 support

> any progress on this? this should answer your question: https://github.com/hill-a/stable-baselines/issues/366 SB3 repo (pytorch): https://github.com/DLR-RM/stable-baselines3 SBX (jax, experimental): https://github.com/araffin/sbx SB2 tf2 (unnofficial): https://github.com/sophiagu/stable-baselines-tf2 SB tf2 (experimental, archive): https://github.com/Stable-Baselines-Team/stable-baselines-tf2/

Antonin RAFFIN

[question] EvalCallback using MPI

[Question] How best to implement self-play/multiple agents in the same environment?

Add minimal TF2 support

Add minimal TF2 support

Add minimal TF2 support

Pre-Training Problem

Possible to run a full episode and collate results? For training on real-time hardware.

Possible to run a full episode and collate results? For training on real-time hardware.

[question] SAC target q nets may be updated many times in each round?

[question] SAC target q nets may be updated many times in each round?