Machine-Learning-6.867-homework icon indicating copy to clipboard operation
Machine-Learning-6.867-homework copied to clipboard

policy gradient

Open manuelli opened this issue 10 years ago • 0 comments

Start thinking about policy gradient methods. What would we need to implement them? What should be the parametric form of the policy?

manuelli avatar Nov 19 '15 23:11 manuelli