Xiaohu Zhu

Results 8 issues of Xiaohu Zhu

I think ddpg can be added, this algorithm performs better for continuous action space. Look forward. :)

https://arxiv.org/pdf/1703.01703.pdf

enhancement
prio:low

FeUdal Networks for Hierarchical Reinforcement Learning https://arxiv.org/pdf/1703.01161.pdf

enhancement
prio:low

Neural Fictitious Self Play https://arxiv.org/abs/1603.01121

enhancement
prio:low

Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep RL https://arxiv.org/pdf/1706.00387.pdf

enhancement
prio:low

linear algebra part is finished.

reviewer wanted

probability part is finished.

reviewer wanted