cartpole topic
ilqr
Iterative Linear Quadratic Regulator with auto-differentiatiable dynamics models
essence
AutoDiff DAG constructor, built on numpy and Cython. A Neural Turing Machine and DeepQ agent run on it. Clean code for educational purpose.
Deep-Reinforcement-Learning-Algorithms
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
safe-control-gym
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
Reinforcement-Learning-Pytorch-Cartpole
Simple Cartpole example writed with pytorch.
Reinforcement_Learning
강화학습에 대한 기본적인 알고리즘 구현
ReinforcementLearning
Reinforcing Your Learning of Reinforcement Learning
dqn-lambda
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.