Pytorch-DPPO on advantages

on advantages

Open cn3c3p opened this issue 7 years ago • 1 comments

after test your PPO, and compare with another , i think your advantages need to been : (advantages - advantages.mean()) / advantages.std() for you reference

Sep 25 '17 01:09 cn3c3p

Thanks for the notification, I will try with this normalization. Can-I ask you with which one did you compare?

Sep 26 '17 13:09 alexis-jacq

Pytorch-DPPO Pytorch-DPPO copied to clipboard

on advantages

Pytorch-DPPO
Pytorch-DPPO copied to clipboard