agents
agents copied to clipboard
DQN sample - AverageReturn output is same as AverageEpisodeLength
I have ran sample: https://github.com/tensorflow/agents/tree/942db59044f2b25151f313dc9a098ff652ab90f2/tf_agents/agents/dqn/examples/v2
Apparently AverageReturn always equal AverageEpisodeLength. Potential bug?
INFO:absl:
AverageReturn = 119.5999984741211
AverageEpisodeLength = 119.5999984741211
INFO:absl:step = 3000, loss = 2.203988
INFO:absl:403.487 steps/sec