reinforcement-learning icon indicating copy to clipboard operation
reinforcement-learning copied to clipboard

Personal experiments on Reinforcement Learning

Results 1 reinforcement-learning issues
Sort by recently updated
recently updated
newest added

In QAgent train(), there is `self.Q[s,a] = self.Q[s,a] + self.lr * (r + self.gamma*np.max(self.Q[s_next,a]) - self.Q[s,a])` but should be imho `self.Q[s,a] = self.Q[s,a] + self.lr * (r + self.gamma*np.max(self.Q[s_next,:]) -...