Reinforcement-learning-with-tensorflow
Reinforcement-learning-with-tensorflow copied to clipboard
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Reinforcement Learning Methods and Tutorials
In these tutorials for reinforcement learning, it covers from the basic RL algorithms to advanced algorithms developed recent years.
If you speak Chinese, visit 莫烦 Python or my Youtube channel for more.
As many requests about making these tutorials available in English, please find them in this playlist: (https://www.youtube.com/playlist?list=PLXO45tsB95cIplu-fLMpUEEZTwrDNh6Ba)
Table of Contents
- Tutorials
- Simple entry example
- Q-learning
- Sarsa
- Sarsa(lambda)
- Deep Q Network (DQN)
- Using OpenAI Gym
- Double DQN
- DQN with Prioitized Experience Replay
- Dueling DQN
- Policy Gradients
- Actor-Critic
- Deep Deterministic Policy Gradient (DDPG)
- A3C
- Dyna-Q
- Proximal Policy Optimization (PPO)
- Curiosity Model, Random Network Distillation (RND)
-
Some of my experiments
- 2D Car
- Robot arm
- BipedalWalker
- LunarLander
Some RL Networks
Deep Q Network
data:image/s3,"s3://crabby-images/77411/774110d80a664ba3914574cef6f8708a9579bbdb" alt=""
Double DQN
data:image/s3,"s3://crabby-images/800c1/800c1c3e08f704b77c7abf32b4b5331483c7d78e" alt=""
Dueling DQN
data:image/s3,"s3://crabby-images/2058b/2058b8adbbab686405d3d4bf9dd0e9275c2f0b4b" alt=""
Actor Critic
data:image/s3,"s3://crabby-images/c1818/c18186eeacdf28c323784c5fe7d3c1fbe774eb64" alt=""
Deep Deterministic Policy Gradient
data:image/s3,"s3://crabby-images/8ae91/8ae91de1a4727d96dbfe477fef92101c0869904d" alt=""
A3C
data:image/s3,"s3://crabby-images/1dbd1/1dbd14dac0fada4741b1b25d076633f55e76573b" alt=""
Proximal Policy Optimization (PPO)
data:image/s3,"s3://crabby-images/3a922/3a92244c0399ae4db7e27e1e9dec9dad65e6023a" alt=""
Curiosity Model
data:image/s3,"s3://crabby-images/1044f/1044f72e3742cf6670dbf58cc3dc99c6cb5459a1" alt=""
Donation
If this does help you, please consider donating to support me for better tutorials. Any contribution is greatly appreciated!