ppo_cpp parallel running

parallel running

Open waynezw0618 opened this issue 5 years ago • 1 comments

Hi Antymon: Your PPO code is interesting. I am wondering whether it can be used for parallel training, where each episode is a mpi/openmp solver. I would like to perform simulations on a cluster with each episode as a single node openmp based simulation. do you have any suggestion?

Yours Sincerely

Wei

Feb 12 '20 15:02 waynezw0618

Hi, this implementation is meant for a single multicore node and even with that respect is rather simple - with multiple environments being called in parallel on each step and centralized network processing resumed once all environments delivered results. TF's graph portability will be useful for you, of course, and perhaps some snippets of my code. I vaguely recall that OpenAI Baselines had MPI implementations in Python that you can look into to port or reuse (there were 2 implementations of PPO initially PPO and PPO2... ). Hope that helps.

Feb 15 '20 14:02 Antymon

ppo_cpp ppo_cpp copied to clipboard

parallel running

ppo_cpp
ppo_cpp copied to clipboard