Antonin RAFFIN comments

Results 880 comments of


                                            Antonin RAFFIN

[feature request] LstmPolicy does not support using net_arch with feature_extraction="cnn"

>Seems like few-line change + docs for a seemingly obvious thing well, as always, I think we did not investigate that too much as MLP + Framestacking is usually both...

[question] Issue with multiple instances for DDPG-MPI from stable-baselines[mpi]

Well, you can always use SB2 DDPG without calling `mpirun` but then you will have to use only one environment. And A2C/PPO are meant to be fast whereas DDPG was...

TensorboardWriter keeps files open

> isn't added is because the original coders wanted the same writer to be accessible later, @hill-a could you comment on that?

NAN problem in PPO1 and PPO2

Maybe related: https://github.com/hill-a/stable-baselines/issues/340 (try setting the entropy coeff to zero)

NAN problem in PPO1 and PPO2

it seems your are using the same env 256 times... you should pass the env id.

ACKTR hangs on atari and works very slow on custom env

Hello, Probably a duplicate of https://github.com/hill-a/stable-baselines/issues/196 Which OS are you using? I would recommend you to use PPO2 (or even [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3) PPO) as it also supports multiprocessing and usually give...

Antonin RAFFIN

[feature request] LstmPolicy does not support using net_arch with feature_extraction="cnn"

[question] Issue with multiple instances for DDPG-MPI from stable-baselines[mpi]

TensorboardWriter keeps files open

NAN problem in PPO1 and PPO2

NAN problem in PPO1 and PPO2

ACKTR hangs on atari and works very slow on custom env

[bug] PPO2 episode reward summaries are written incorrectly for VecEnvs

[bug] PPO2 episode reward summaries are written incorrectly for VecEnvs

[bug] PPO2 episode reward summaries are written incorrectly for VecEnvs

[bug] PPO2 episode reward summaries are written incorrectly for VecEnvs