Kevin

Results 1 issues of Kevin

When training with different reward functions it's hard to compare 2 bots. A `callback` capable of running `n` games between current agent and another would prove useful to measure progress....