Hongzi Mao
Results
5
repositories owned by
Hongzi Mao
a3c
26
Stars
14
Forks
Watchers
Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
deeprm
272
Stars
146
Forks
Watchers
Resource Management with Deep Reinforcement Learning (HotNets '16)
input_driven_rl_example
28
Stars
10
Forks
Watchers
Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)