pg_rnn
pg_rnn copied to clipboard
About the reward
Hi. When I run your code, I find a mistake in run_pg_rnn.py.
The last line print("reward is {0}".format(np.sum(episode["rewards"])))
should be print("reward is {0}".format(np.sum(episode["returns"])))
Besides, why is the reward always 0?