pg_rnn
pg_rnn copied to clipboard

→

Metadata

There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog f...

Readme
Issues

Policy Gradient with Recurrent Neural Network (RNN)

Modular implementation of Vanila Policy Gradient (VPG) algorithm with an RNN policy.

Dependencies

Python 2.7 or 3.5
TensorFlow 1.10
gym
numpy
tqdm progress-bar

Features

Using an RNN policy for giving the action probabilities for a reinforcement learning problem
Using a sampler that reshape the trajectory to be feed into an RNN policy
Using gradient clipping to solve the exploding gradient problem
Using GRU to solve the vanishing gradient problem

Usage

To train a model for Cartpole-v0:

$ python run_pg_rnn.py

To view the tensorboard

$tensorboard --logdir .

Results

Tensorboard Progress Bar

About

There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog f...

reinforcement-learning

recurrent-neural-networks

policy-gradient

18

Stars

2

Forks

Watchers

Owner

abhishm

← Metadata

18

Stars

2

Forks

Watchers

Owner

abhishm

Metadata

There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog f...

Back

pg_rnn pg_rnn copied to clipboard

Metadata

Policy Gradient with Recurrent Neural Network (RNN)

Dependencies

Features

Usage

Results

← Metadata

Owner

Metadata

pg_rnn
pg_rnn copied to clipboard