intuitive_policy_gradient
intuitive_policy_gradient copied to clipboard

Published 20 hours ago •

Reame
Issues

calculating the gradient

Open mtorabirad opened this issue 4 years ago • 0 comments

Thanks for the nice tutorial. Could you comment on where the line "slid.grad = slid.value * (1-slid.value)" is coming from?

Dec 05 '20 14:12 mtorabirad