Bryan comments

Results 11 comments of


                                            Bryan

Only squeeze when appropriate

Any reason why the new PR was necessary? It seems to be the exact same code as the original with the comment deleted and the linter is still complaining here...

Multiple worker support

Thanks for the reply—actually, I ended up realizing I'm looking for parallelized sampling, not parallelized consumption.

It's unclear to me how parallel sampling is implemented here without vectorized environments, i.e., like https://stable-baselines.readthedocs.io/en/master/guide/vec_envs.html. Can you explain the approach?

Advice for Sample Factory use?

Also — I noticed that you created this issue in the past https://github.com/ray-project/ray/issues/5278 and I'm curious what adjustments were key in this implementation to get PPO with LSTMs to work

Advice for Sample Factory use?

Thanks for the detailed answer, appreciate it! It makes sense to me to use sample factory then. I'll give it a shot soon 👍 A few details I forgot to...

Documentation now available (questions about documentation here)

Hi, Thanks for the documentation. After reading through I'm still confused on the dynamic of batch_T / batch_size / replay_ratio. In specific, I'm trying to recreate the training loop [here](https://github.com/hill-a/stable-baselines/blob/master/stable_baselines/td3/td3.py#L276):...

Bryan

Only squeeze when appropriate

Multiple worker support

Multiple worker support

Multiple worker support

Advice for Sample Factory use?

Advice for Sample Factory use?

Documentation now available (questions about documentation here)

add Hindsight Experience Replay to replay buffers

Scroll settings are not honored in Native Notebook

Inference on TPU's