robust_RL_multi_adversary
robust_RL_multi_adversary copied to clipboard

Published 20 hours ago •

→

Metadata

We investigate the effect of populations on finding good solutions to the robust MDP

Reame
Issues

Results 14 robust_RL_multi_adversary issues

Sort by recently updated

How to print the distribution of action?

Hello! I want to have a better understanding of this code, so I need to get the distribution of action, such as the mean and standard deviation of Gaussian distribution....

Bandit Changes

Added Bernoulli Bandit and appropriately updated run scripts.

ma_crowd run script is broken on master

The ma_crowd script fails for me with the folowing error: ``` File "/Users/kanaad/miniconda3/envs/sim2real/lib/python3.6/site-packages/ray/rllib/agents/trainer.py", line 415, in train raise e File "/Users/kanaad/miniconda3/envs/sim2real/lib/python3.6/site-packages/ray/rllib/agents/trainer.py", line 401, in train result = Trainable.train(self) File "/Users/kanaad/miniconda3/envs/sim2real/lib/python3.6/site-packages/ray/tune/trainable.py",...

Images don't change color in the replay tests

bug

visualizer doesn't work with multi-agent

Because of the import of env_creator

Find the memory leak in our environment

The RAM usage of the training goes up and up over time, leading to a failed experiment. Why?

Add Gaussian noise case to transfer tests

Add an auto-encoder model for the adversary

The adversary applies actions at the latents of the autoencoder

Add unit tests for the transfer tests

Rollout tests should clear the video files

Currently the rollout of a trained policy test generates a video but does not clear it up after.

1
2
›

About

We investigate the effect of populations on finding good solutions to the robust MDP

24

Stars

8

Forks

Watchers

Owner

← Metadata

24

Stars

8

Forks

Watchers

Owner

Metadata

We investigate the effect of populations on finding good solutions to the robust MDP