RL-intro
RL-intro copied to clipboard
RL Intro
2019 edition - Gym intro, Genetics, CEM, Tabular DQN
0. Gym interface
1. Genetic algorithm
Additional materials
- [recommended] - awesome openai post about evolution strategies - blog post, article
- Video on genetic algorithms - https://www.youtube.com/watch?v=ejxfTy4lI6I
- Another guide to genetic algorithm - https://www.youtube.com/watch?v=zwYV11a__HQ
- PDF on Differential evolution - http://jvanderw.une.edu.au/DE_1.pdf
- Video on Ant Colony Algorithm - https://www.youtube.com/watch?v=D58nLNLkb0I
- Longer video on Ant Colony Algorithm - https://www.youtube.com/watch?v=xpyKmjJuqhk
2. Cross Entropy Method
Additional materials
- [main] Video-intro by David Silver - https://www.youtube.com/watch?v=2pWv7GOvuf0
- Optional lecture by David Silver - https://www.youtube.com/watch?v=lfHX2hHRMVQ
- [recommended] - formal explanation of crossentropy method in general and for optimization
3. Tabular
Additional materials
- [main] lecture by David Silver - url
- Alternative lecture by Pieter Abbeel: part 1, part 2
- Alternative lecture by John Schulmann: https://www.youtube.com/watch?v=IL3gVyJMmhg
- Definitive guide in policy/value iteration from Sutton: start from page 81 here.
4. DQN
Additional materials
- Lecture by David Silver - video part I, video part II
- Alternative lecture by Pieter Abbeel - video
- Alternative lecture by John Schulmann - video
- Blog post on q-learning Vs SARSA - url
- N-step temporal difference from Sutton's book - suttonbook chapter 7
- Eligibility traces from Sutton's book - suttonbook chapter 12
- Blog post on eligibility traces - url
2020 edition - Deep RL, DQN, DDPG