deep-rl-class
deep-rl-class copied to clipboard
[HANDS-ON BUG] Two different variables for one value
Describe the bug
In unit 2 in the section "Monte Carlo vs Temporal Difference Learning" the learning rate is discussed. It's denoted lr (in the text) as well as alpha (in the pictures/slides). I'd recommend to use the same symbol which makes it easier to understand.