deep_rl icon indicating copy to clipboard operation
deep_rl copied to clipboard

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

deep_rl

Code style: black

Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.

Simplicity is the ultimate sophistication. Leonardo da Vinci

Algorithm
A2C :x:
ARS :x:
C51 :heavy_check_mark:
DDPG :heavy_check_mark:
Double DQN :x:
DQN :heavy_check_mark:
Dueling DQN :heavy_check_mark:
FQF :x:
IQN :heavy_check_mark:
RAINBOW :x:
Recurrent PPO :x:
REINFORCE :heavy_check_mark:
PER (1) :heavy_check_mark:
PPO :heavy_check_mark:
QR-DQN :x:
SAC :heavy_check_mark:
TD3 :x:
TQC :x:
TRPO :x:

(1) with DQN

Credit

Most implementations are taken from vwxyzjn/cleanrl without being stricly equivalent. Other ressources: toshikwa/fqf-iqn-qrdqn.pytorch

Contributions

are welcome.