[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
YyzHarry
TD-Gammon implementation
dellalibera