rlai-exercises
rlai-exercises copied to clipboard
Exercise Solutions for Reinforcement Learning: An Introduction [2nd Edition]
Hi, Hector I am referring to the second edition of the book. Exercise 2.4 If the step-size parameters, αn, are not constant, then the estimate Q n is a weighted...
Resolves #4 by updating Exercises 2.7 and 2.8
## Exercises missing Two new exercises (2.7 and 2.8) appear in the latest version of the book available online and they don't in the repository. Check [the book online](http://incompleteideas.net/book/the-book-2nd.html) for...
In my local machine, I am unable to download the libraries testbed which is showing error. When using estimators it says `ImportError: cannot import name 'SampleAverageEstimator' from 'estimators'` These packages...
## Expected Behavior Running GradientBandit shouldn't raise any overflow warnings. ## Current Behavior The warning "RuntimeWarning: overflow encountered in double_scalars" is raised occasionally in the following lines: estimators.py:110 `updated_numerical_preference[action_selected] =...