chainerrl icon indicating copy to clipboard operation
chainerrl copied to clipboard

Observations passed to an agent must not be overwritten

Open toslunar opened this issue 7 years ago • 0 comments

Some agents e.g. chainerrl.agents.DQN store a state argument of act_and_train without copying. This implementation doesn't always work well, e.g. if env.step constantly returns an array like env._state that has a fixed buffer.

toslunar avatar Oct 23 '17 09:10 toslunar