proximal-policy-optimization topic
Contra-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Contra
Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
google-football-pytorch
It's the pytorch implementation of google research football.
reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...
walk_the_blocks
Implementation of Scheduled Policy Optimization for task-oriented language grouding
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR)...