DRL_papernotes
DRL_papernotes copied to clipboard
Notes and comments about Deep Reinforcement Learning papers
Deep Reinforcement Learning papernotes
New Hierarchical-Learning section here.
2017-05
- Curiosity-driven Exploration by Self-supervised Prediction [arXiv]
2017-03
- Surprised-Based Intrinsic Motivation for Deep Reinforcement Learning[arXiv]
- Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation [arXiv]
2017-02
2017-01
- Deep Recurrent Q-Learning for Partially Observable MDPs [arXiv]
2016-12
- Playing Doom with SLAM-Augmented Deep Reinforcement Learning [arXiv]
- Learning to predict where to look in interactive environments using deep recurrent q-learning [arXiv]
2016-11
- Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates[arXiv]
- Reinforcement Learning with Unsupervised Auxiliary Tasks [arXiv]
- Learning to Navigate in Complex Environments [arXiv]
- Learning to reinforcement learn [arXiv]
2016-10
- Hybrid computing using a neural network with dynamic external memory [nature]
- A Deep Hierarchical Approach to Lifelong Learning in Minecraft [arXiv]
- Towards Cognitive Exploration through Deep Reinforcement Learning for Mobile Robots [arXiv]
2016-09
- Target-driven Visual Navigation in Indoor Scenes using Deep Reinforcement Learning [arXiv]
- Playing FPS Games with Deep Reinforcement Learning [arXiv]
2016-08
- Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection [arXiv]
2016-05
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation [arXiv]
- Value Iteration Networks [arXiv]
2016-04
- End-to-End Training of Deep Visuomotor Policies [arXiv]
2016-02
- Prioritized Experience Replay [arXiv]
- Asynchronous Methods for Deep Reinforcement Learning [arXiv]
- Continuous control with deep reinforcement learning [arXiv]
- Graying the black box: Understanding DQNs [arXiv]
2015-12
- Deep Reinforcement Learning with Double Q-learning [arXiv]
- Deep Attention Recurrent Q-Network [arXiv]