Shuvendu Roy

Results 10 comments of Shuvendu Roy

I tried 'Pong-V0' and it did not quite do well. Max reward I got was -2. What might have caused the problem?

As this environment is not currently available in gym, what should I change to reduce this 16 frame gap to 4

Ok, that's good But I am wondering about the shape of the state. According to the original paper ``` The details of the architecture are explained in the Methods. The...

Looks like something mysterious is happening. I also wonder how this even worked without sequence information. Any idea what is going on here? Are we Bruteforcing the model to learn...

ok!!! I am not quite getting the logic from code, where from this 4 frames are coming. As this env skips 4 frames, what is the situation now? Are we...

ow. I got it. Thanks :-)

With all this information with the modification, I trained the model. But could not quite regenerate the result as the original one. [Here ](https://github.com/ShuvenduBikash/Deep-reinforcement-learning/blob/master/q_learning/2.double_dqn/2.cnn_dqn.py)is the code and the result ![](https://github.com/ShuvenduBikash/Deep-reinforcement-learning/blob/master/q_learning/2.double_dqn/images/stacked_1m_frame.png?raw=true)...

FixMatch. Not sure any other have this issue.

@lowtronik that hex format. just decode it result.decode("utf-8", "replace")