reinforcement-learning issues

Bump numpy from 1.12.1 to 1.22.0

Bumps [numpy](https://github.com/numpy/numpy) from 1.12.1 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...

dependabot[bot]

dependencies

Bump tensorflow from 1.0.0 to 2.7.2

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.0.0 to 2.7.2. Release notes Sourced from tensorflow's releases. TensorFlow 2.7.2 Release 2.7.2 This releases introduces several vulnerability fixes: Fixes a code injection in saved_model_cli (CVE-2022-29216) Fixes...

dependabot[bot]

dependencies

Bump pillow from 4.1.0 to 9.0.1

Bumps [pillow](https://github.com/python-pillow/Pillow) from 4.1.0 to 9.0.1. Release notes Sourced from pillow's releases. 9.0.1 https://pillow.readthedocs.io/en/stable/releasenotes/9.0.1.html Changes In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [@radarhere, @hugovk] Restrict builtins within...

dependabot[bot]

dependencies

Cartpole Policy Gradient script does not converge (2-cartpole/3-reinforce/cartpole_reinforce.py)

I am running the script [here](https://github.com/rlcode/reinforcement-learning/blob/master/2-cartpole/3-reinforce/cartpole_reinforce.py) but even after 500 episodes it does not converge. You can see the graph I get below: ![score](https://user-images.githubusercontent.com/14084682/129976668-ec03b85d-e44f-429c-a59a-7a7354be4b72.png) In contrast this is the supposedly...

a-ozbek

How to run this example code?

Hello, I saw there was run.py script for running the example in README. But I can't find that script. How can I get full-code example? Thanks, Regards.

ghost