IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures icon indicating copy to clipboard operation
IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures copied to clipboard

Implementation of Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures

  • These results are from only 4 threads. So unstable to train.
  • Tensorflow Implementation
  • A3C type thread environment training method
  • PongDeterministic-v4 environment

Todo

  • [x] Only CPU Training method
  • [ ] Use Network protocol method
  • [ ] Training on GPU, Inference on CPU

Reference