genrl icon indicating copy to clipboard operation
genrl copied to clipboard

Categorical DQN not training

Open sampreet-arthi opened this issue 4 years ago • 7 comments

There might be some shape related errors or we're missing something. Either that or hyperparameters need to be tuned.

sampreet-arthi avatar Jul 31 '20 11:07 sampreet-arthi

If someone wants to try this out, feel free to do so.

sampreet-arthi avatar Aug 12 '20 21:08 sampreet-arthi

One issue is that the shapes are inferred directly using foo.reshape(-1, some_size) instead of foo.reshape(some_known_size, some_size). Might take some playing around to figure out the problem.

sampreet-arthi avatar Aug 12 '20 21:08 sampreet-arthi

The categorical_q_target function seems to be the main problem here. Tried to reshape and modify it a lot but didn't work. I'm guessing the problem is because of both n_envs and batch_size there seems to be a reshaping problem somewhere. It is not very easy to figure out which dimension refers to which variable while reshaping in the function.

sampreet-arthi avatar Aug 21 '20 22:08 sampreet-arthi

Is this still an issue?

ajaysub110 avatar Aug 29 '20 04:08 ajaysub110

Yes

On Sat, 29 Aug 2020, 09:33 Ajay Subramanian, [email protected] wrote:

Is this still an issue?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/SforAiDl/genrl/issues/208#issuecomment-683232280, or unsubscribe https://github.com/notifications/unsubscribe-auth/AH72FJ57LIGMPZEDOOJLENTSDB4YLANCNFSM4PQBXD7Q .

Sharad24 avatar Aug 29 '20 06:08 Sharad24

Can I take this up?

ajaysub110 avatar Aug 29 '20 11:08 ajaysub110

Sure, go ahead.

Sharad24 avatar Aug 29 '20 11:08 Sharad24