Reinforcement learning

A reference for Books, Articles, Courses and Educational Materials in this field

🗺️Roadmap . 📚Books · 📄Articles · 🎓Courses . 💻Blogs . ⚙️Projects

🐛Report Bug . ❓Ask Question . 👍Like the repo . 👎Dislike the repo

Note

:white_check_mark::100: it means, I have seen and tried them, I definitely recommend them to you

🔥 If you want to get free access to the books, courses and articles, get in touch with me with social media🔥

📣 You can help improve this repo, by giving me a ⭐star⭐ and ❤️following me❤️

❗️What is Reinforcement Learning(RL)❓️

Reinforcement learning (RL) is a field of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning.

You can also read below blog to understand the key concepts in Reinforcement Learning

:link: https://medium.com/intro-to-artificial-intelligence/key-concepts-in-reinforcement-learning-2af715dfbfa

:world_map:Roadmap

Warning This is only a segguestion roadmap

:books: Books

*"Reinforcement Learning: An Introduction - Second Edition - Richard S. Sutton and Andrew G. Barto"

"Deep Reinforcement Learning with Python - Second Edition - Sudharsan Ravichandiran"

"Grokking Deep Reinforcement Learning - Miguel Morales"

"PyTorch 1.x Reinforcement Learning Cookbook - Yuxi (Hayden) Liu"

"Deep Reinforcement Learning Hands-On - 2nd Edition - Maxim Lapan"

"TensorFlow 2 Reinforcement Learning Cookbook - 2nd Edition - Praveen Palanisamy"

:page_facing_up: Articles

📍Sorted by Time📍

1958 Oct: Rosenblatt, F. "The perceptron: A probabilistic model for information storage and organization in the brain.."

1983 Oct: Andrew G. Barto, et al. "Neuronlike adaptive elements that can solve difficult learning control problems."

1988 Feb (TD): Sutton, R.S. "Learning to Predict by the Methods of Temporal Differences."

1992 May: Watkins, C.J.C.H., Dayan, P. "Q-learning". Mach Learn 8, 279–292 (1992)

1994 Nov: G. A. Rummery, M. Niranjan. "On-Line Q-Learning Using Connectionist Systems"

1995 Mar: Gerald Tesauro. "Temporal Difference Learning and TD-Gammon."

2005 Oct: Riedmiller, Martin. "Neural fitted Q iteration–first experiences with a data efficient neural reinforcement learning method."

2012 Jul (ALE): Bellemare, Marc G., et al. "The Arcade Learning Environment: An Evaluation Platform for General Agents."

2013 Dec (DQN): Volodymyr Mnih, et al. "Playing Atari with Deep Reinforcement Learning."

2015 Feb (DQN): Mnih, Volodymyr, et al. "Human-level control through deep reinforcement learning."

2015 Feb: Bernhard Schölkopf. "Learning to see and act." Nature518, pages486–487 (2015)

2015 Sep (DDQN): Hado van Hasselt, et al. "Deep Reinforcement Learning with Double Q-learning."

2015 Sep (DDPG): Lillicrap, Timothy P., et al. "Continuous control with deep reinforcement learning."

2015 Nov: Ziyu Wang, et al. "Dueling Network Architectures for Deep Reinforcement Learning."

2015 Nov (PER): Schaul, Tom, et al. "Prioritized Experience Replay."

2016 Jan: David Silver, et al. "Mastering the game of Go with deep neural networks and tree search."

2016 Jun: Brockman, Greg, et al. "Openai gym." arXiv preprint arXiv:1606.01540.

2016 Feb (A3C): Volodymyr Mnih, et al. "Asynchronous Methods for Deep Reinforcement Learning."

2017 Aug (PPO): Schulman, John, et al. "Proximal policy optimization algorithms."

2017 Oct: Hessel, Matteo, et al. "Rainbow: Combining Improvements in Deep Reinforcement Learning."

2018 Dec (SAC): Haarnoja, Tuomas, et al. "Soft Actor-Critic Algorithms and Applications."

2020 Oct: Xing Wua, et al. "Adaptive stock trading strategies with deep reinforcement learning methods."

2022 Mar: Wen, Y. et al. "Improving the exploration efficiency of DQNs via the confidence bound methods."

:mortar_board: Courses

1. Reinforcement Learning Specialization

Master the Concepts of Reinforcement Learning. Implement a complete RL solution and understand how to apply AI tools to solve real-world problems.

Offered By UNIVERSITY OF ALBERTA ALBERTA MACHINE INTELLIGENCE INSTITUTE
There are 4 Courses in this Specialization
:link:Fundamentals of Reinforcement Learning
:link:Sample-based Learning Methods
:link:Prediction and Control with Function Approximation
:link:A Complete Reinforcement Learning System (Capstone)

2. Become a Deep Reinforcement Learning Expert

Learn the deep reinforcement learning skills that are powering amazing advances in AI. Then start applying these to applications like video games and robotics.

:white_check_mark::100:Presented by UDACITY - Nanodegree Program

3. DeepMind x UCL | Deep Learning Lecture Series 2021

The Deep Learning Lecture Series is a collaboration between DeepMind and the UCL Centre for Artificial Intelligence.

:white_check_mark::100:Presented by DeepMind & UCL

4. Advanced AI: Deep Reinforcement Learning in Python

Created by Lazy Programmer Team, Lazy Programmer Inc.

The Complete Guide to Mastering Artificial Intelligence using Deep Learning and Neural Networks

5. Deep Reinforcement Learning, Decision Making and control

Lectures for UC Berkeley CS 285: Deep Reinforcement Learning for Fall 2021 - Instructor: Sergey Levine

:computer: Blogs

:gear: Projects

`1- Space Invader`

Note In this environment, the observation is an RGB image of the screen, which is an array of shape (210, 160, 3) Each action is repeatedly performed for a duration of k frames, where k is uniformly sampled from {2, 3, 4}. Our Target is to maximize our score.

`2- Frozen Lake`

Note Frozen lake involves crossing a frozen lake from Start(S) to Goal(G) without falling into any Holes(H) by walking over the Frozen(F) lake. The agent may not always move in the intended direction due to the slippery nature of the frozen lake.

`3- Ping Pong`

Note

`4- Mountain Car`

Note

`5- Atari Breakout`

Note

`6- Gridworld Enviromnet`

Note This recipe will walk you through the steps to build a Gridworld learning environment to train RL agents. Gridworld is a simple environment where the world is represented as a grid. Each location on the grid can be referred to as a cell. The goal of an agent in this environment is to find its way to the goal.

Deep-RL
Deep-RL copied to clipboard

Metadata

Reinforcement learning

A reference for Books, Articles, Courses and Educational Materials in this field

❗️What is Reinforcement Learning(RL)❓️

:world_map:Roadmap

:books: Books

:page_facing_up: Articles

📍Sorted by Time📍

:mortar_board: Courses

:computer: Blogs

📍Deep Reinforcement Learning📍

📍Value-Based Methods📍

📍Policy-Based Methods📍

📍Value & Policy Based Methods📍

📍OpeniAi Gym📍

📍Relevant & Attractive Content📍

:gear: Projects

`1- Space Invader`

`2- Frozen Lake`

`3- Ping Pong`

`4- Mountain Car`

`5- Atari Breakout`

`6- Gridworld Enviromnet`

← Metadata

Owner

Metadata

Deep-RL Deep-RL copied to clipboard

Metadata

Reinforcement learning

A reference for Books, Articles, Courses and Educational Materials in this field

❗️What is Reinforcement Learning(RL)❓️

:world_map:Roadmap

:books: Books

:page_facing_up: Articles

📍Sorted by Time📍

:mortar_board: Courses

:computer: Blogs

📍Deep Reinforcement Learning📍

📍Value-Based Methods📍

📍Policy-Based Methods📍

📍Value & Policy Based Methods📍

📍OpeniAi Gym📍

📍Relevant & Attractive Content📍

:gear: Projects

1- Space Invader

2- Frozen Lake

3- Ping Pong

4- Mountain Car

5- Atari Breakout

6- Gridworld Enviromnet

← Metadata

Owner

Metadata

Deep-RL
Deep-RL copied to clipboard

`1- Space Invader`

`2- Frozen Lake`

`3- Ping Pong`

`4- Mountain Car`

`5- Atari Breakout`

`6- Gridworld Enviromnet`