adventures-in-ml-code
adventures-in-ml-code copied to clipboard
double/dueling Q learning
Should not the model be fitted (keras.fit(...)) and predicted (keras.predict(state)) in double Q learning (and also in dueling Q learning) examples? Seems you also forget to apply the same in the atari example. Do the graphs in the associated blog-posts are from actual experiments or should we expect something different? I mean have you tested your implementations?
Thanks