MPO icon indicating copy to clipboard operation
MPO copied to clipboard

Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments

Results 1 MPO issues
Sort by recently updated
recently updated
newest added

I have enjoyed your really clean implementation of MPO. Thank you for making it available. I was looking at the critic update and think I may have spotted a bug....