Cliff-Walking-Solution icon indicating copy to clipboard operation
Cliff-Walking-Solution copied to clipboard

Q-learning and SARSA algorithms from Sutton's Reinforcement Learning book.

Cliff Walking Exercise: Sutton's Reinforcement Learning :robot:

My implementation of Q-learning and SARSA algorithms for a simple grid-world environment.

The code involves visualization utility functions for visualizing reward convergence, agent paths for SARSA and Q-learning together with heat maps of the agent's action/value function.

Contents: :star:

  • cliff_walking.py: Q-learning, SARSA, Visualization Functions
  • cliff_walking_report.pdf: Analysis on the Q-learning and SARSA algorithms

References: