Machine-Learning-6.867-homework icon indicating copy to clipboard operation
Machine-Learning-6.867-homework copied to clipboard

Results 7 Machine-Learning-6.867-homework issues
Sort by recently updated
recently updated
newest added

AI raycasting game to simulate self driving for saving people's lives https://fatalfeel.blogspot.com/2013/12/ppo-ai-on-raycast-3d-game-self-driving.html

alias ddConsoleApp=$HOME/path-to/build/bin/ddConsoleApp should be path to -> **director/build/install/bin/directorPython** all imports inside codes are need to be replaced from **ddapp** to **director**.

In the new version of `director`, it's now called `directorPython`. In addition, import statements referring to `ddapp` should be changed to `director`.

- [x] Implement abstract Q-Learning Class - [x] SARSA, Q-Learning, should all be subclasses Methods to implement - [x] SARSA, see #32 - [ ] policy search / policy gradient,...

- [x] Implement SARSA update - [x] Test it for convergence with a known policy, i.e. our simple controller - [x] Implement full SARSA policy updates

Start thinking about policy gradient methods. What would we need to implement them? What should be the parametric form of the policy?