adversarial-policies icon indicating copy to clipboard operation
adversarial-policies copied to clipboard

Find best-response to a fixed policy in multi-agent RL

Results 8 adversarial-policies issues
Sort by recently updated
recently updated
newest added

Hello: thanks for your interesting paper and code, i am really enjoying your work and i have small questions Q1: Do you have any documents, explaining the main files, configurations...

It would be nice to make `modelfree.hyperparams.train_rl` a tune.Trainable rather than a function, adding checkpointing support. This would let us use the HyperBand and Population Based Training schedulers. Conceptually this...

enhancement

- [ ] Use new format for VecNormalize: https://github.com/hill-a/stable-baselines/pull/525 - [ ] Switch to context manager to ensure policies are closed? - [ ] Consider switching to `BasePolicy` rather than...

https://github.com/HumanCompatibleAI/adversarial-policies/blob/3a273ea9b7a02c34f95917bb56c1473e9a1af3eb/src/modelfree/common/utils.py#L44 https://github.com/IDSIA/sacred/issues/498 was resolved.

To reproduce: 1. `docker run -it --env MUJOCO_KEY=URL_TO_YOUR_MUJOCO_KEY \ humancompatibleai/adversarial_policies:latest /bin/bash # change tag if built locally` 2. `ci/build_venv.sh` 3. The following error occurs: seems like something wrong with the...

I'm trying to reproduce the result but got stuck at the first step. Seems like some dependencies of the stable-baseline3 is wrong? To reproduce this error, just clone this repo...

Thanks for your nice work! I try to reproduce this work by writing it myself, but I got some questions on the win condition of sumo humans. I noticed the...

The base docker image is no more available in Docker hub. Can you please update this repo with all dependencies fixed? ![image](https://github.com/HumanCompatibleAI/adversarial-policies/assets/123585894/b59d0d7f-2b88-4370-b645-ae8c1569f3c7)