Cliff-Walking-Solution
Cliff-Walking-Solution copied to clipboard
Q-learning and SARSA algorithms from Sutton's Reinforcement Learning book.
Cliff Walking Exercise: Sutton's Reinforcement Learning :robot:
My implementation of Q-learning
and SARSA
algorithms for a simple grid-world environment.
The code involves visualization utility functions
for visualizing reward convergence, agent paths for SARSA and Q-learning together with heat maps of the agent's action/value function.
Contents: :star:
-
cliff_walking.py
: Q-learning, SARSA, Visualization Functions -
cliff_walking_report.pdf
: Analysis on the Q-learning and SARSA algorithms