reinforcement-learning issues

Why use self.batch_size instead of batch_size

1

From reinforcement-learning/2-cartpole/1-dqn/cartpole_dqn.py/train_model def train_model(self): if len(self.memory) < self.train_start: return batch_size = min(self.batch_size, len(self.memory)) mini_batch = random.sample(self.memory, batch_size) update_input = np.zeros((batch_size, self.state_size)) update_target = np.zeros((batch_size, self.state_size)) action, reward, done = [],...

JieMEI1994

Use traned agent

Hello, trained agent play CartPole-v1 with score 500, but when I restart it with ... self.load_model from = True and with correct name, it start learning again with low score...

Sat0ri

Tensorflow A3C implementation gridworld environment

akileshbadrinaaraayanan

save_graph dir was missing.

hiroyachiba

Failing to converge with increase in grid-size (Grid World)

5

If I increase both the HEIGHT and WIDTH from 5 to 10 keeping the obstacles and the final goal at the same position, Deep SARSA network doesn't seem to converge....

akileshbadrinaaraayanan

My code is very poor in learning 2048 game using Double DQN

2

Firstly, thanks for the great collection of code and articles. The articles were very useful in understanding DQN and implementing it. However, my code is very bad in learning. I...

codetiger

Bump tensorflow from 1.0.0 to 2.11.1

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.0.0 to 2.11.1. Release notes Sourced from tensorflow's releases. TensorFlow 2.11.1 Release 2.11.1 Note: TensorFlow 2.10 was the last TensorFlow release that supported GPU on native-Windows. Starting...

dependabot[bot]

dependencies

Bump pillow from 4.1.0 to 9.3.0

Bumps [pillow](https://github.com/python-pillow/Pillow) from 4.1.0 to 9.3.0. Release notes Sourced from pillow's releases. 9.3.0 https://pillow.readthedocs.io/en/stable/releasenotes/9.3.0.html Changes Initialize libtiff buffer when saving #6699 [@radarhere] Limit SAMPLESPERPIXEL to avoid runtime DOS #6700 [@wiredfool]...

dependabot[bot]

dependencies

links in cartpole are broken

I was here: https://github.com/rlcode/reinforcement-learning/tree/master/2-cartpole After the first image, the rest look like this: ![image](https://github.com/rlcode/reinforcement-learning/assets/10841261/88b45743-2ff9-44c8-8034-0616a4807c25) Code for those looks like this: ![image](https://github.com/rlcode/reinforcement-learning/assets/10841261/e68b816d-abe1-4afd-b21f-4262ebb1605f) In the repo, the file structure looks like this:...

EngrStudent

reinforcement-learning
reinforcement-learning copied to clipboard

Metadata

Why use self.batch_size instead of batch_size

Use traned agent

Tensorflow A3C implementation gridworld environment

save_graph dir was missing.

Failing to converge with increase in grid-size (Grid World)

My code is very poor in learning 2048 game using Double DQN

Bump tensorflow from 1.0.0 to 2.11.1

Bump pillow from 4.1.0 to 9.3.0

links in cartpole are broken

← Metadata

Owner

Metadata

reinforcement-learning reinforcement-learning copied to clipboard

Metadata

← Metadata

Owner

Metadata

reinforcement-learning
reinforcement-learning copied to clipboard