Kevin
Results
1
issues of
Kevin
When training with different reward functions it's hard to compare 2 bots. A `callback` capable of running `n` games between current agent and another would prove useful to measure progress....