PPO_Lagrangian_PyTorch
PPO_Lagrangian_PyTorch copied to clipboard
Implementation of PPO Lagrangian in PyTorch
trafficstars
PPO Lagrangian Reproduction in Pytorch
Implementation of PPO Lagrangian from Benchmarking Safe Exploration in Deep Reinforcement Learning Paper (Ray et al, 2019) in PyTorch
python ppo.py
Results
- Reward Returns

- Cost Returns (Cost limit=25)
