actor-critic-public
actor-critic-public copied to clipboard
Experiment details?
For benchmarks sake, how long did the models in the paper take to train and on what type/how many GPUs were used?