SA_DQN The same level of agents in the paper cannot be reproduced from the source code

The same level of agents in the paper cannot be reproduced from the source code

Open RyanHaoranLi opened this issue 1 year ago • 0 comments

Dear authors: Thanks for sharing this code! This is a great work! However, when I try to train the robust agents with PGD solver, I cannot get the same level of agents of the paper. After testing these agents, only get:

0.0 +- 0.0 average reward in the Bankheist environment.
0.0 +- 0.0 average reward in the RoadRunner environment.
21.46 +- 1.6150541786577934 average reward in the Freeway environment.
-21.0 +- 0.0 average reward in the Pong environment.

Maybe my configured environment is the cause? Could you provide the exact version of python and main packages (torch, gym, numpy etc.)? Or what else might be the cause？

Looking forward to hearing from you! Thank you!

Jun 29 '23 08:06 RyanHaoranLi

SA_DQN SA_DQN copied to clipboard

The same level of agents in the paper cannot be reproduced from the source code

SA_DQN
SA_DQN copied to clipboard