EthanCodesss

Results 2 issues of EthanCodesss

Hi! First,In ppo.py `self.policy = self.loss = -self.policy_loss + self.value_loss - self.entropy_loss` you said ' Reduce sum over all sub-policies (where only the active sub-policy will be non-zero due to...

Hello! It appears that the references for the algorithm have not been correctly displayed or included in the documentation, which makes it challenging for me to trace back the original...