on-policy issues

Results 25 on-policy issues

Sort by recently updated

Update mpe_runner.py

cannot reproduce the performance of MPE

Hi, I have an issue when reproducing the performance of simple_spread in MPE. The only modifications on your code: 1. use `--use_wandb` to disable wandb in `train_mpe.sh` 2. add `self.envs.reset()`...

hccz95

Being confused about The huber_loss

the function huber_loss in utils is like: ``` def huber_loss(e, d): a = (abs(e) d).float() return a*e**2/2 + b*d*(abs(e)-d/2) ``` It may come with a zero loss when error is...

Atan03

Error in setting `bad_transition`

https://github.com/marlbenchmark/on-policy/blob/0483adc4b55233c649eece2458fe6fba367d26d9/onpolicy/envs/starcraft2/StarCraft2_Env.py#L560-L577 It should be bad_transition = True in line 563.

xihuai18

VecEnvWrapper is not defined in env_wrappers.py

SamueleBolotta

Using a global state

In the config.py file, there is this env parameter: ``` parser.add_argument("--use_obs_instead_of_state", action='store_true', default=False, help="Whether to use global state or concatenated obs") ``` I would like to use a global state...

SamueleBolotta

An error occurs when I run rmappo on football

The output of python is listed here: Traceback (most recent call last): File "train/train_football.py", line 203, in main(sys.argv[1:]) File "train/train_football.py", line 188, in main runner.run() File "/onpolicy/runner/shared/football_runner.py", line 43, in...

cugbbaiyun

on-policy
on-policy copied to clipboard

Metadata

Update mpe_runner.py

cannot reproduce the performance of MPE

Being confused about The huber_loss

Error in setting `bad_transition`

VecEnvWrapper is not defined in env_wrappers.py

Using a global state

An error occurs when I run rmappo on football

Can you open-source MASAC code base?

Action mask！

Shape of buffered log_probs

← Metadata

Owner

Metadata

on-policy on-policy copied to clipboard

Metadata

← Metadata

Owner

Metadata

on-policy
on-policy copied to clipboard