rl-book-challenge icon indicating copy to clipboard operation
rl-book-challenge copied to clipboard

self-studying the Sutton & Barto the hard way

Results 1 rl-book-challenge issues
Sort by recently updated
recently updated
newest added

The function np.argmax, does not cut ties randomly: eg: Q = [1,0,1] np.argmax always picks the first 1. Whereas all ties while selecting should be cut randomly. This can lead...