ICQ icon indicating copy to clipboard operation
ICQ copied to clipboard

Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS 2021 Spotlight https://arxiv.org/abs/2106.03400)

Results 6 ICQ issues
Sort by recently updated
recently updated
newest added

关于 actor和critic中softmax各自温度的选择,请问是否有一些大概的规律,比如说为什么critic的温度要比actor大这么多,再比如是否是多样性越高的数据集,温度应该设置的越高呢?

Can you give me total data of SMAC that you used in your experiments?

**'test_greedy': False,** 'test_interval': 20000, 'test_nepisode': 20, 'use_cuda': True, 'use_tensorboard': True}

Hi, Thanks for providing the code. When I run `python3 src/main.py --config=offpg_smac --env-config=sc2 with env_args.map_name=3s_vs_3z` I got a error about the tensor in different devices, when I used torch==1.1.0 or...

Hi, Thanks for providing the code. Other configurations (e.g., --config=qmix_smac) cannot run. I was wondering if you plan to fix these other configurations?

Dear author, thanks for making the code available. I have two questions regarding ICQ_softmax, where the weight is approximated by the softmax over the minibatch: 1. Why len(weights), e.g., [this...