safeRL issues

Results 3 safeRL issues

Sort by recently updated

Safe recovery

Cleaned up the code and restructured it into an object oriented structure. ## Why? Clutter in `agent2.py` made it almost impossible to debug ## Test: ``` cd $RECOVERYPATH python agent_trainer.py...

Santara

State and reward normalization bug

## Location: https://github.com/Santara/safeRL/blob/c52382977616075971de68b56e031192e388ce6c/safe_recovery/agent_config.yml#L18-L19 ## Issue: Setting these options to `true` throws the following `Tensorflow reuse error`

Santara

Recovery policy code only supports single instance rollout and training

In the following line, reward is specified as a list with a single element. https://github.com/hari-sikchi/safeRL/blob/b4f0443b109d5d3290771528115087eb5dd763ce/safe_recovery/agent2.py#L352-L354 For multiple parallel rollouts, this list should be filled up asynchronously by different instances of...

Santara

safeRL
safeRL copied to clipboard

Metadata

Safe recovery

State and reward normalization bug

Recovery policy code only supports single instance rollout and training

← Metadata

Owner

Metadata

safeRL safeRL copied to clipboard

Metadata

Safe recovery

State and reward normalization bug

Recovery policy code only supports single instance rollout and training

← Metadata

Owner

Metadata

safeRL
safeRL copied to clipboard