IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures
IMPALA-Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures copied to clipboard

Published 20 hours ago •

→

Metadata

Readme
Issues

Implementation of Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures

These results are from only 4 threads. So unstable to train.
Tensorflow Implementation
A3C type thread environment training method
PongDeterministic-v4 environment

Todo

[x] Only CPU Training method
[ ] Use Network protocol method
[ ] Training on GPU, Inference on CPU

Reference

← Metadata

34

Stars

3

Forks

Watchers

Owner

Metadata