Sean Fuhrman

Results 1 issues of Sean Fuhrman

### 🐛 Bug When MaskablePPO early exits due to target_kl, n_updates is still updated by 'self.n_epochs' instead being incremented only on successful epochs. Therefore if it early exits at epoch...

bug
good first issue
help wanted