Implicit-Q-Learning
Implicit-Q-Learning copied to clipboard
PyTorch implementation of the implicit Q-learning algorithm (IQL)
Implicit-Q-Learning (IQL)
PyTorch implementation of the implicit Q-learning algorithm IQL (Paper)
Currently only implemented for online learning. Offline RL version with D4RL will be updated soon.
Run
python train.py
Results
Continuous IQL
Pendulum
Discrete IQL
CartPole
Reference
Original JAX implementation: IQL
Help and issues:
Im open for feedback, found bugs, improvements or anything. Just leave me a message or contact me.
Author
- Sebastian Dittert
Feel free to use this code for your own projects or research.
@misc{IQL,
author = {Dittert, Sebastian},
title = {PyTorch Implementation of Implicit-Q-Learning (IQL)},
year = {2021},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/BY571/Implicit-Q-Learning}},
}