papers icon indicating copy to clipboard operation
papers copied to clipboard

2018-05

  • Progress & Compress: A scalable framework for continual learning [arxiv] & [notes]
  • Playing hard exploration games by watching YouTube [arxiv] & [notes]

2018-04

  • DORA The Explorer: Directed Outreaching Reinforcement Action-Selection [arxiv] & [notes]
  • Gotta Learn Fast: A New Benchmark for Generalization in RL [arxiv] & [notes]

2018-03

  • An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling [arxiv] & [notes]
  • Generative Multi-Agent Behavioral Cloning [arxiv] & [notes]
  • World Models [arxiv] & [notes]
  • Semi-parametric Topological Memory for Navigation [arxiv] & [notes]
  • A disciplined approach to neural network hyper-parameters: Part 1 -- learning rate, batch size, momentum, and weight decay [arxiv] & [notes]

2018-02

  • Model-Ensemble Trust-Region Policy Optimization [arxiv] & [notes]

2018-01

  • Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor [arxiv] & [notes]

2017-08

  • A Brief Survey of Deep Reinforcement Learning [arxiv] & [notes]
  • Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control [arxiv] & [notes]

2017-07

  • Distral: Robust Multitask Reinforcement Learning [arxiv] & [notes]

2017-03

  • Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks [arxiv] & [notes]

2017-02

  • Cognitive Mapping and Planning for Visual Navigation [arxiv] & [notes]

2016-06