phasic-policy-gradient Add evaluation logging

Add evaluation logging

Open RobertKirk opened this issue 3 years ago • 0 comments

This commit adds periodic logging of evaluation scores for the policy being trained.

It also adds num_levels and start_level to the arguments.

Based on code from @rraileanu

Jan 16 '22 14:01 RobertKirk