Machine-Learning-6.867-homework
Machine-Learning-6.867-homework copied to clipboard

Published 20 hours ago •

Reame
Issues

policy gradient

Open manuelli opened this issue 10 years ago • 0 comments

Start thinking about policy gradient methods. What would we need to implement them? What should be the parametric form of the policy?

Nov 19 '15 23:11 manuelli