Moritz Zanger

Results 2 issues of Moritz Zanger

Hi, I've been wondering whether the code for the approximate trust region update of the critic (via clipping losses) is a little more convoluted than it has to be. specifically,...

Hi, I am observing a strange behavior by the tensorflow default boot dqn agent that I am a bit baffled by. When running sweeps over multiple environments, the agent loses...