pg_rnn
pg_rnn copied to clipboard

→

Metadata

There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog f...

Reame
Issues

Results 2 pg_rnn issues

Sort by recently updated

About the reward

Hi. When I run your code, I find a mistake in run_pg_rnn.py. The last line print("reward is {0}".format(np.sum(episode["rewards"]))) should be print("reward is {0}".format(np.sum(episode["returns"]))) Besides, why is the reward always 0?

wangdan269

About gradient

Hello, I want to involve RL into my RNN model, your code is a great beginning to me. I have a question about the gradient calculation part in your '/pg_rnn.py'...

ShengleiH

About

There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog f...

reinforcement-learning

recurrent-neural-networks

policy-gradient

18

Stars

2

Forks

Watchers

Owner

abhishm

← Metadata

18

Stars

2

Forks

Watchers

Owner

abhishm

Metadata

There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog f...

Back

pg_rnn pg_rnn copied to clipboard

Metadata

About the reward

About gradient

← Metadata

Owner

Metadata

pg_rnn
pg_rnn copied to clipboard