reinforcement-learning-an-introduction-chinese icon indicating copy to clipboard operation
reinforcement-learning-an-introduction-chinese copied to clipboard

(第2章) 2.5 追踪非平稳问题

Open PinkEx opened this issue 6 months ago • 1 comments

公式(7)下面的“注意,对于样本平均情况……”一行所说的“恒定步长参数的情况“下,α_n(α)=n应该改为α_n(α)=α?

PinkEx avatar Dec 19 '23 08:12 PinkEx