PPO_Lagrangian_PyTorch
                                
                                
                                
                                    PPO_Lagrangian_PyTorch copied to clipboard
                            
                            
                            
                        Implementation of PPO Lagrangian in PyTorch
PPO Lagrangian Reproduction in Pytorch
Implementation of PPO Lagrangian from Benchmarking Safe Exploration in Deep Reinforcement Learning Paper (Ray et al, 2019) in PyTorch
python ppo.py
Results
- Reward Returns

 - Cost Returns (Cost limit=25)
