isql
isql copied to clipboard
Inferring beliefs about dynamics from behavior
Inverse Soft Q-Learning
Learn an agent's beliefs about the dynamics of the environment from their control demonstrations.
Usage
Install Python dependencies with the pip package manager using
pip install -r requirements.txt
Install gym (version 0.9.2) and replace gym/envs/box2d/lunar_lander.py
with isql/lunar_lander.py
.
Install baselines and replace baselines/baselines/deepq/{build_graph|simple}.py
with isql/{build_graph|simple}.py
.
Extract this zip file into isql/
.
Questions and comments
Please contact the author at sgr [at] berkeley [dot] edu
if you have questions or find bugs. Be warned that not all cells are in order, so you might have to hunt around to deal with NameErrors.
Citation
If you find this software useful in your work, we kindly request that you cite the following paper:
@InProceedings{Reddy/etal/18b,
title={Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior},
author={Reddy, Siddharth and Dragan, Anca D. and Levine, Sergey},
booktitle={Arxiv 1805.08010},
year={2018},
url={https://arxiv.org/abs/1805.08010}
}