modular_rl icon indicating copy to clipboard operation
modular_rl copied to clipboard

Will dropout break out the final loss of ppo algorithm?

Open ppaanngggg opened this issue 7 years ago • 1 comments

If I add dropout layer to model, will it be a bad idea?

Any experiments there?

ppaanngggg avatar Sep 13 '17 03:09 ppaanngggg

I use eval model when explore environment, and use train model for policy, old policy and value model when training

ppaanngggg avatar Sep 13 '17 03:09 ppaanngggg