rlai-exercises issues

Results 5 rlai-exercises issues

Sort by recently updated

Answer of exercise 2.4 is wrong

Hi, Hector I am referring to the second edition of the book. Exercise 2.4 If the step-size parameters, αn, are not constant, then the estimate Q n is a weighted...

AbhishekVarghese

Update Exercises 2.7 and 2.8 to 2nd edition version

Resolves #4 by updating Exercises 2.7 and 2.8

adamlm

Add 2.7 and 2.8 from the most recent version of the book

## Exercises missing Two new exercises (2.7 and 2.8) appear in the latest version of the book available online and they don't in the repository. Check [the book online](http://incompleteideas.net/book/the-book-2nd.html) for...

iamhectorotero

good first issue

Unable to use packages estimators and testbed

In my local machine, I am unable to download the libraries testbed which is showing error. When using estimators it says `ImportError: cannot import name 'SampleAverageEstimator' from 'estimators'` These packages...

AmolGirishShah

Solve numpy overflow warning in GradientBandit.

## Expected Behavior Running GradientBandit shouldn't raise any overflow warnings. ## Current Behavior The warning "RuntimeWarning: overflow encountered in double_scalars" is raised occasionally in the following lines: estimators.py:110 `updated_numerical_preference[action_selected] =...

iamhectorotero

good first issue

rlai-exercises
rlai-exercises copied to clipboard

Metadata

Answer of exercise 2.4 is wrong

Update Exercises 2.7 and 2.8 to 2nd edition version

Add 2.7 and 2.8 from the most recent version of the book

Unable to use packages estimators and testbed

Solve numpy overflow warning in GradientBandit.

← Metadata

Owner

Metadata

rlai-exercises rlai-exercises copied to clipboard

Metadata

Answer of exercise 2.4 is wrong

Update Exercises 2.7 and 2.8 to 2nd edition version

Add 2.7 and 2.8 from the most recent version of the book

Unable to use packages estimators and testbed

Solve numpy overflow warning in GradientBandit.

← Metadata

Owner

Metadata

rlai-exercises
rlai-exercises copied to clipboard