Machine-Learning-6.867-homework SARSA

SARSA

Open manuelli opened this issue 9 years ago • 1 comments

[x] Implement SARSA update
[x] Test it for convergence with a known policy, i.e. our simple controller
[x] Implement full SARSA policy updates

Nov 19 '15 22:11 manuelli

Currently the discrete version of this is working. 4 inner and 4 outer bins seems to work reasonably well. Have implemented the continuous version with function approximation. Now need to test it our for convergence.

Dec 01 '15 18:12 manuelli

Machine-Learning-6.867-homework Machine-Learning-6.867-homework copied to clipboard

SARSA

Machine-Learning-6.867-homework
Machine-Learning-6.867-homework copied to clipboard