policy-gradient topic
CodeGAN
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks :octocat:
Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
awesome-monte-carlo-tree-search-papers
A curated list of Monte Carlo tree search papers with implementations.
pytorch-learn-reinforcement-learning
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems...
deep_trading
This project aims to select a supervised algorithm that can predict stock prices basing on historical data and use the predictor generated to form trading strategies.
imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
TAA-PG
Usage of policy gradient reinforcement learning to solve portfolio optimization problems (Tactical Asset Allocation).
tianshou
An elegant PyTorch deep reinforcement learning library.
btgym
Scalable, event-driven, deep-learning-friendly backtesting library