DQN-Trading icon indicating copy to clipboard operation
DQN-Trading copied to clipboard

It seems training not working(rewards don't converge at all)

Open zhiyiZeng opened this issue 1 year ago • 2 comments

This repo is pretty awesome. I'm trying to run a basic demo, but the training process seems not working at all (rewards don't converge at all). However, the agent still outperforms B&H a lot.(even when the reward is negative! ) I'm confused by this situation. Is there an explanation about this?

The graph is rewards with training epochs=50. image

zhiyiZeng avatar Aug 26 '22 05:08 zhiyiZeng