DeepRL-TensorFlow2 icon indicating copy to clipboard operation
DeepRL-TensorFlow2 copied to clipboard

"PPO_Continuous.py" trained 1000 EP without effect

Open Synmul opened this issue 5 years ago • 3 comments

[No changes have been made to the code. tensorflow version is 2.2, will this affect it? 20200610215857

Synmul avatar Jun 10 '20 14:06 Synmul

Similar thing happened to me . I tried A2C continuous for pendulum without any change (except total episode was set to 3000) but reward is still varies between -1000 to -0 , it rarely goes to -0. So i tried A2C discrete without any change for cartpole and again it is too slow to train ..

MedhaviMonish avatar Jul 01 '20 09:07 MedhaviMonish

I received the same results - PPO continuous doesn't appear to learn anything. I'm running TF2.3, so it doesn't have to do with your version @Synmul

natetsang avatar Jun 18 '21 22:06 natetsang

Same here. No changes

alifrahmatullah avatar Feb 17 '22 01:02 alifrahmatullah

@Synmul did you close because it was fixed?

natetsang avatar Dec 28 '23 15:12 natetsang