basic_reinforcement_learning about the code

about the code

Open mazzzystar opened this issue 7 years ago • 1 comments

In tutorial1, qlearn_mod_random.pyline 32:

if random.random() < self.epsilon:
            minQ = min(q)
            mag = max(abs(minQ), abs(maxQ))
            # add random values to all the actions, recalculate maxQ
            q = [q[i] + random.random() * mag - .5 * mag for i in range(len(self.actions))]
            maxQ = max(q)

why use this(versus qlearn.py)?

Apr 15 '17 03:04 mazzzystar

I reconstructed your code in a more configurable way if your pardon. The link is mycode, and the question above is still bother me, I appreciate so much if you can give an interpretation.

Apr 27 '17 14:04 mazzzystar

basic_reinforcement_learning basic_reinforcement_learning copied to clipboard

about the code

basic_reinforcement_learning
basic_reinforcement_learning copied to clipboard