Deep_reinforcement_learning_Course icon indicating copy to clipboard operation
Deep_reinforcement_learning_Course copied to clipboard

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

Results 48 Deep_reinforcement_learning_Course issues
Sort by recently updated
recently updated
newest added

I have implemented the DQN space invader notebook in Google Colab, as well as in jupyter notebooks. Like clockwork it crashes at 12 episodes, due to a memory (RAM) overflow....

I was training the same code that you have provided, But it is getting killed after 17 episodes. What can I infer from this? This happened 3 times! Model Saved...

I've trained the model for 50 total episodes. However, when I run the last code cell, the action is always the same. I've printed Qs and the action, and the...

Hi Thomas, (Since this issue got resolved without any proper answer, I'm submitting it again.) I don't understand that why we are doing tf.reduce_sum and multiple the network output to...

Hi Simon I am looking at your implementation of the PPO model. After going through the code a couple of times I think in the implementation, although you created two...

Hi, Im just training DQN and notice that most of training loops will use the `range()` function to iterate, and save in every 5 episode via a judgement `if episode...

Hey. Shouldn't `self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_))` in DQN class be `self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_), axis=1)`, i.e. reduced along columns so that the output length of `self.Q` is equal to the...

Was not sure why code is not restoring mean and rms values I made mods as follows so on restart it can pickup where it left off. Is there a...

I've run both pong and space-invaders for long training sets without any issues...but when I try breakout the system hangs some where inside breakout it never finishes the last life...

reset not called in MaxAndSkipEnv..must be specified as: ``` def reset(self): return self.env.reset() ``` without args.