SA_DQN icon indicating copy to clipboard operation
SA_DQN copied to clipboard

The same level of agents in the paper cannot be reproduced from the source code

Open RyanHaoranLi opened this issue 1 year ago • 0 comments

Dear authors: Thanks for sharing this code! This is a great work! However, when I try to train the robust agents with PGD solver, I cannot get the same level of agents of the paper. After testing these agents, only get:

  • 0.0 +- 0.0 average reward in the Bankheist environment. image
  • 0.0 +- 0.0 average reward in the RoadRunner environment. image
  • 21.46 +- 1.6150541786577934 average reward in the Freeway environment. image
  • -21.0 +- 0.0 average reward in the Pong environment. image

Maybe my configured environment is the cause? Could you provide the exact version of python and main packages (torch, gym, numpy etc.)? Or what else might be the cause?

Looking forward to hearing from you! Thank you!

RyanHaoranLi avatar Jun 29 '23 08:06 RyanHaoranLi