Machine-Learning-6.867-homework
Machine-Learning-6.867-homework copied to clipboard
SARSA
- [x] Implement SARSA update
- [x] Test it for convergence with a known policy, i.e. our simple controller
- [x] Implement full SARSA policy updates
Currently the discrete version of this is working. 4 inner and 4 outer bins seems to work reasonably well. Have implemented the continuous version with function approximation. Now need to test it our for convergence.