Tianhong Dai
Tianhong Dai
distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
google-football-pytorch
It's the pytorch implementation of google research football.
hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
integrated-gradient-pytorch
This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.
mosse-object-tracking
This is the implementation of MOSSE tracking algorithm (correlation filter based).
reinforcement-learning-algorithms
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are st...
self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.