Machine-Learning-6.867-homework
Machine-Learning-6.867-homework copied to clipboard
AI raycasting game to simulate self driving for saving people's lives https://fatalfeel.blogspot.com/2013/12/ppo-ai-on-raycast-3d-game-self-driving.html
alias ddConsoleApp=$HOME/path-to/build/bin/ddConsoleApp should be path to -> **director/build/install/bin/directorPython** all imports inside codes are need to be replaced from **ddapp** to **director**.
In the new version of `director`, it's now called `directorPython`. In addition, import statements referring to `ddapp` should be changed to `director`.
- [x] Implement abstract Q-Learning Class - [x] SARSA, Q-Learning, should all be subclasses Methods to implement - [x] SARSA, see #32 - [ ] policy search / policy gradient,...
SARSA
- [x] Implement SARSA update - [x] Test it for convergence with a known policy, i.e. our simple controller - [x] Implement full SARSA policy updates
Start thinking about policy gradient methods. What would we need to implement them? What should be the parametric form of the policy?