adventures-in-ml-code double/dueling Q learning

double/dueling Q learning

Open MGKhKhD opened this issue 5 years ago • 0 comments

Should not the model be fitted (keras.fit(...)) and predicted (keras.predict(state)) in double Q learning (and also in dueling Q learning) examples? Seems you also forget to apply the same in the atari example. Do the graphs in the associated blog-posts are from actual experiments or should we expect something different? I mean have you tested your implementations?

Thanks

Dec 26 '19 08:12 MGKhKhD

adventures-in-ml-code adventures-in-ml-code copied to clipboard

double/dueling Q learning

adventures-in-ml-code
adventures-in-ml-code copied to clipboard