AAAI21-RoutineAugmentedPolicyLearning icon indicating copy to clipboard operation
AAAI21-RoutineAugmentedPolicyLearning copied to clipboard

Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration

Sec0: Introduction

This is the official code release to our AAAI21 work titled "Augmenting Policy Learning with Routines Discovered from a Single Demonstration".

Authors: Zelin Zhao (me), Chuang Gan, Jiajun Wu, Xiaoxiao Guo, Joshua Tenenbaum.

Work was done during Zelin’s internship at MIT.

Paper link:

Sec1: Installation

  1. Install miniconda
  1. create an environment
conda create -n baselines python=3.7
  1. install libraries
pip install tensorflow-gpu==1.14 ffmpeg-python matplotlib
pip install gym 
pip install gym[atari]
  1. install baselines
git clone
cd baselines
pip install -e .
  1. install pytorch
conda install pytorch torchvision -c soumith

Sec3: Training expert policy

python --mode expert --seed 0

Sec4: Make demonstration and Abstract routines

python --mode abstraction --seed 0

Sec5: Train and test command

python --mode routine --seed 0

Trouble Shooting

  1. ValueError: Cannot feed value of shape (1, 210, 160, 12) for Tensor 'Placeholder:0', which has shape '(?, 84, 84, 4)'

    Gym version error. Please ensure that gym version is 0.10.5.