deep-rl-tensorflow Setting's of the Corridor game

Setting's of the Corridor game

Open huihuiqu opened this issue 8 years ago • 0 comments

Could you please tell me how did you set the reward at each state? It seems that all F states will receive an reward thus an agent might just keep staying on F states till episode ends and it will automatically receive max reward. I cannot reproduce the result of the dueling network's corridor game. Could you please give me any hints?

Feb 09 '17 08:02 huihuiqu

deep-rl-tensorflow deep-rl-tensorflow copied to clipboard

Setting's of the Corridor game

deep-rl-tensorflow
deep-rl-tensorflow copied to clipboard