rl-book-challenge
rl-book-challenge copied to clipboard
self-studying the Sutton & Barto the hard way
Results
1
rl-book-challenge issues
Sort by
recently updated
recently updated
newest added
The function np.argmax, does not cut ties randomly: eg: Q = [1,0,1] np.argmax always picks the first 1. Whereas all ties while selecting should be cut randomly. This can lead...