chainerrl icon indicating copy to clipboard operation
chainerrl copied to clipboard

interpolated policy gradient (IPG)

Open tigerneil opened this issue 8 years ago • 0 comments

Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep RL

https://arxiv.org/pdf/1706.00387.pdf

tigerneil avatar Jun 06 '17 10:06 tigerneil