Atari icon indicating copy to clipboard operation
Atari copied to clipboard

Fix bootstrapped DQN

Open Kaixhin opened this issue 9 years ago • 5 comments

The test on Beam Rider is failing badly, and does not look promising.

Kaixhin avatar Apr 18 '16 10:04 Kaixhin

Hi @Kaixhin, did you try using my layer?

iassael avatar Apr 18 '16 10:04 iassael

@iassael couple of questions about your layer. Can it use more complicated heads (like the dueling head)? How does it work on picking a new head for a new episode vs. using the mode in ensemble mode (during evaluation)? Is it possible to train with the "full" version of the bootstrap - when each head requires a separate experience replay memory?

Kaixhin avatar Apr 18 '16 10:04 Kaixhin

hey @Kaixhin currently nope. For the former we could pass the module as a parameter, and for the latter it should be super easy to extend it with an extra parameter of the episode id.

iassael avatar Apr 25 '16 18:04 iassael

@iassael I'm focusing on some of the other components at the moment so I'm not sure I'll get to this any time soon, but feel free to give it a shot if you can.

Kaixhin avatar Apr 25 '16 18:04 Kaixhin

@Kaixhin I'll keep you posted and thanks for the awesome work cheers~

iassael avatar Apr 25 '16 23:04 iassael