RL-intro icon indicating copy to clipboard operation
RL-intro copied to clipboard

RL Intro

2019 edition - Gym intro, Genetics, CEM, Tabular DQN

0. Gym interface

  • 00-gym.ipynb Open In Colab

1. Genetic algorithm

  • slides
  • 01-genetics.ipynb Open In Colab
Additional materials
  • [recommended] - awesome openai post about evolution strategies - blog post, article
  • Video on genetic algorithms - https://www.youtube.com/watch?v=ejxfTy4lI6I
  • Another guide to genetic algorithm - https://www.youtube.com/watch?v=zwYV11a__HQ
  • PDF on Differential evolution - http://jvanderw.une.edu.au/DE_1.pdf
  • Video on Ant Colony Algorithm - https://www.youtube.com/watch?v=D58nLNLkb0I
  • Longer video on Ant Colony Algorithm - https://www.youtube.com/watch?v=xpyKmjJuqhk

2. Cross Entropy Method

  • slides
  • 02-cem.ipynb Open In Colab
Additional materials
  • [main] Video-intro by David Silver - https://www.youtube.com/watch?v=2pWv7GOvuf0
  • Optional lecture by David Silver - https://www.youtube.com/watch?v=lfHX2hHRMVQ
  • [recommended] - formal explanation of crossentropy method in general and for optimization

3. Tabular

  • slides
  • 03-tabular.ipynb Open In Colab
Additional materials
  • [main] lecture by David Silver - url
  • Alternative lecture by Pieter Abbeel: part 1, part 2
  • Alternative lecture by John Schulmann: https://www.youtube.com/watch?v=IL3gVyJMmhg
  • Definitive guide in policy/value iteration from Sutton: start from page 81 here.

4. DQN

  • slides
  • 04-dqn.ipynb Open In Colab
Additional materials
  • Lecture by David Silver - video part I, video part II
  • Alternative lecture by Pieter Abbeel - video
  • Alternative lecture by John Schulmann - video
  • Blog post on q-learning Vs SARSA - url
  • N-step temporal difference from Sutton's book - suttonbook chapter 7
  • Eligibility traces from Sutton's book - suttonbook chapter 12
  • Blog post on eligibility traces - url
2020 edition - Deep RL, DQN, DDPG

Credits