reinforcement-learning icon indicating copy to clipboard operation
reinforcement-learning copied to clipboard

Minimal and Clean Reinforcement Learning Examples

Results 39 reinforcement-learning issues
Sort by recently updated
recently updated
newest added

From reinforcement-learning/2-cartpole/1-dqn/cartpole_dqn.py/train_model def train_model(self): if len(self.memory) < self.train_start: return batch_size = min(self.batch_size, len(self.memory)) mini_batch = random.sample(self.memory, batch_size) update_input = np.zeros((batch_size, self.state_size)) update_target = np.zeros((batch_size, self.state_size)) action, reward, done = [],...

Hello, trained agent play CartPole-v1 with score 500, but when I restart it with ... self.load_model from = True and with correct name, it start learning again with low score...

If I increase both the HEIGHT and WIDTH from 5 to 10 keeping the obstacles and the final goal at the same position, Deep SARSA network doesn't seem to converge....

Firstly, thanks for the great collection of code and articles. The articles were very useful in understanding DQN and implementing it. However, my code is very bad in learning. I...

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 1.0.0 to 2.11.1. Release notes Sourced from tensorflow's releases. TensorFlow 2.11.1 Release 2.11.1 Note: TensorFlow 2.10 was the last TensorFlow release that supported GPU on native-Windows. Starting...

dependencies

Bumps [pillow](https://github.com/python-pillow/Pillow) from 4.1.0 to 9.3.0. Release notes Sourced from pillow's releases. 9.3.0 https://pillow.readthedocs.io/en/stable/releasenotes/9.3.0.html Changes Initialize libtiff buffer when saving #6699 [@​radarhere] Limit SAMPLESPERPIXEL to avoid runtime DOS #6700 [@​wiredfool]...

dependencies

I was here: https://github.com/rlcode/reinforcement-learning/tree/master/2-cartpole After the first image, the rest look like this: ![image](https://github.com/rlcode/reinforcement-learning/assets/10841261/88b45743-2ff9-44c8-8034-0616a4807c25) Code for those looks like this: ![image](https://github.com/rlcode/reinforcement-learning/assets/10841261/e68b816d-abe1-4afd-b21f-4262ebb1605f) In the repo, the file structure looks like this:...