MPO
MPO copied to clipboard
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
Results
1
MPO issues
Sort by
recently updated
recently updated
newest added
I have enjoyed your really clean implementation of MPO. Thank you for making it available. I was looking at the critic update and think I may have spotted a bug....