Deep-RL
Deep-RL copied to clipboard
You can see a reference for Books, Articles, Courses and Educational Materials in this field. Implementation of Reinforcement Learning Algorithms and Environments. Python, OpenAI Gym, Tensorflow.
Reinforcement learning
A reference for Books, Articles, Courses and Educational Materials in this field
🗺️Roadmap . 📚Books · 📄Articles · 🎓Courses . 💻Blogs . ⚙️Projects
🐛Report Bug
.
❓Ask Question
.
👍Like the repo
.
👎Dislike the repo
Note
:white_check_mark::100: it means, I have seen and tried them, I definitely recommend them to you
🔥 If you want to get free access to the books, courses and articles, get in touch with me with social media🔥
📣 You can help improve this repo, by giving me a ⭐star⭐ and ❤️following me❤️
❗️What is Reinforcement Learning(RL)❓️
Reinforcement learning (RL) is a field of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning.
You can also read below blog to understand the key concepts in Reinforcement Learning
:link: https://medium.com/intro-to-artificial-intelligence/key-concepts-in-reinforcement-learning-2af715dfbfa
:world_map:Roadmap
Warning This is only a segguestion roadmap
:books: Books
*"Reinforcement Learning: An Introduction - Second Edition - Richard S. Sutton and Andrew G. Barto"
"Deep Reinforcement Learning with Python - Second Edition - Sudharsan Ravichandiran"
"Grokking Deep Reinforcement Learning - Miguel Morales"
"PyTorch 1.x Reinforcement Learning Cookbook - Yuxi (Hayden) Liu"
"Deep Reinforcement Learning Hands-On - 2nd Edition - Maxim Lapan"
"TensorFlow 2 Reinforcement Learning Cookbook - 2nd Edition - Praveen Palanisamy"
:page_facing_up: Articles
📍Sorted by Time📍
1958 Oct: Rosenblatt, F. "The perceptron: A probabilistic model for information storage and organization in the brain.."
1983 Oct: Andrew G. Barto, et al. "Neuronlike adaptive elements that can solve difficult learning control problems."
1988 Feb (TD): Sutton, R.S. "Learning to Predict by the Methods of Temporal Differences."
1992 May: Watkins, C.J.C.H., Dayan, P. "Q-learning". Mach Learn 8, 279–292 (1992)
1994 Nov: G. A. Rummery, M. Niranjan. "On-Line Q-Learning Using Connectionist Systems"
1995 Mar: Gerald Tesauro. "Temporal Difference Learning and TD-Gammon."
2005 Oct: Riedmiller, Martin. "Neural fitted Q iteration–first experiences with a data efficient neural reinforcement learning method."
2012 Jul (ALE): Bellemare, Marc G., et al. "The Arcade Learning Environment: An Evaluation Platform for General Agents."
2013 Dec (DQN): Volodymyr Mnih, et al. "Playing Atari with Deep Reinforcement Learning."
2015 Feb (DQN): Mnih, Volodymyr, et al. "Human-level control through deep reinforcement learning."
2015 Feb: Bernhard Schölkopf. "Learning to see and act." Nature518, pages486–487 (2015)
2015 Sep (DDQN): Hado van Hasselt, et al. "Deep Reinforcement Learning with Double Q-learning."
2015 Sep (DDPG): Lillicrap, Timothy P., et al. "Continuous control with deep reinforcement learning."
2015 Nov: Ziyu Wang, et al. "Dueling Network Architectures for Deep Reinforcement Learning."
2015 Nov (PER): Schaul, Tom, et al. "Prioritized Experience Replay."
2016 Jan: David Silver, et al. "Mastering the game of Go with deep neural networks and tree search."
2016 Jun: Brockman, Greg, et al. "Openai gym." arXiv preprint arXiv:1606.01540.
2016 Feb (A3C): Volodymyr Mnih, et al. "Asynchronous Methods for Deep Reinforcement Learning."
2017 Aug (PPO): Schulman, John, et al. "Proximal policy optimization algorithms."
2017 Oct: Hessel, Matteo, et al. "Rainbow: Combining Improvements in Deep Reinforcement Learning."
2018 Dec (SAC): Haarnoja, Tuomas, et al. "Soft Actor-Critic Algorithms and Applications."
2020 Oct: Xing Wua, et al. "Adaptive stock trading strategies with deep reinforcement learning methods."
2022 Mar: Wen, Y. et al. "Improving the exploration efficiency of DQNs via the confidence bound methods."
:mortar_board: Courses
1. Reinforcement Learning Specialization
Master the Concepts of Reinforcement Learning. Implement a complete RL solution and understand how to apply AI tools to solve real-world problems.
- Offered By UNIVERSITY OF ALBERTA ALBERTA MACHINE INTELLIGENCE INSTITUTE
- There are 4 Courses in this Specialization
:link:Fundamentals of Reinforcement Learning
:link:Sample-based Learning Methods
:link:Prediction and Control with Function Approximation
:link:A Complete Reinforcement Learning System (Capstone)
2. Become a Deep Reinforcement Learning Expert
Learn the deep reinforcement learning skills that are powering amazing advances in AI. Then start applying these to applications like video games and robotics.
- :white_check_mark::100:Presented by UDACITY - Nanodegree Program
3. DeepMind x UCL | Deep Learning Lecture Series 2021
The Deep Learning Lecture Series is a collaboration between DeepMind and the UCL Centre for Artificial Intelligence.
- :white_check_mark::100:Presented by DeepMind & UCL
4. Advanced AI: Deep Reinforcement Learning in Python
Created by Lazy Programmer Team, Lazy Programmer Inc.
- The Complete Guide to Mastering Artificial Intelligence using Deep Learning and Neural Networks
5. Deep Reinforcement Learning, Decision Making and control
Lectures for UC Berkeley CS 285: Deep Reinforcement Learning for Fall 2021 - Instructor: Sergey Levine
:computer: Blogs
📍Deep Reinforcement Learning📍
:link: An introduction to Deep Q-Learning: let’s play Doom
:link: Deep Reinforcement Learning: Pong from Pixels :white_check_mark::100:Andrej Karpathy blog
📍Value-Based Methods📍
:link: Q-learning: a value-based reinforcement learning algorithm
:link: Deep Q Network(DQN)- Applying Neural Network as a functional approximation in Q-learning
:link: Diving deeper into Reinforcement Learning with Q-Learning
📍Policy-Based Methods📍
:link: REINFORCE — a policy-gradient based reinforcement Learning algorithm
:link: Policy Based Reinforcement Learning, the Easy Way
:link: Reinforcement Learning Policy for Developers
:link: An introduction to Policy Gradients with Cartpole and Doom
📍Value & Policy Based Methods📍
:link: Introduction to Actor Critic in Reinforcement Learning
:link: The Actor-Critic Reinforcement Learning algorithm
📍OpeniAi Gym📍
:link: Getting Started With OpenAI Gym: The Basic Building Blocks
📍Relevant & Attractive Content📍
:link: This Factory Robot Learns a New Job Overnight
:link: RL — Basics algorithms and terms
:link: The Complete Reinforcement Learning Dictionary
:gear: Projects
1- Space Invader
Note In this environment, the observation is an RGB image of the screen, which is an array of shape (210, 160, 3) Each action is repeatedly performed for a duration of k frames, where k is uniformly sampled from {2, 3, 4}. Our Target is to maximize our score.
2- Frozen Lake
Note Frozen lake involves crossing a frozen lake from Start(S) to Goal(G) without falling into any Holes(H) by walking over the Frozen(F) lake. The agent may not always move in the intended direction due to the slippery nature of the frozen lake.
3- Ping Pong
Note
4- Mountain Car
Note
5- Atari Breakout
Note
6- Gridworld Enviromnet
Note This recipe will walk you through the steps to build a Gridworld learning environment to train RL agents. Gridworld is a simple environment where the world is represented as a grid. Each location on the grid can be referred to as a cell. The goal of an agent in this environment is to find its way to the goal.
