IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures
IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures copied to clipboard
Implementation of Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures
- These results are from only 4 threads. So unstable to train.
- Tensorflow Implementation
- A3C type thread environment training method
- PongDeterministic-v4 environment
Todo
- [x] Only CPU Training method
- [ ] Use Network protocol method
- [ ] Training on GPU, Inference on CPU