Mava icon indicating copy to clipboard operation
Mava copied to clipboard

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Results 160 Mava issues
Sort by recently updated
recently updated
newest added

**Describe the bug** When using my own instance of Reverb and Launchpad I can scale up to 10 executors. However, when I use Mava I can only scale to 6...

bug

### Feature Investigating using `reverb.TrajectoryDataset` for all recurrent systems. ### Proposal Change `make_dataset_iterator` to use `reverb.TrajectoryDataset` since it appears that TrajectoryDataset works better (faster trainer speed) when using the SequenceAdder....

enhancement

**Describe the bug** Both optimizers train the observation network separately which can caused the representation learnt to be unstable. Happens in MAD4PG. **Expected behavior** only one optimizer should work on...

bug

When trying to run ```bash python examples/meltingpot/train_on_substrates.py ``` The RAM usage continuously keeps on increasing till it's full and gets killed. Is this expected or is there a memory leak...

bug

**Describe the bug** The method [recurrent_n_step_critic_loss](https://github.com/instadeepai/Mava/blob/develop/mava/components/tf/losses/sequence.py#L26) is broken in its current implementation. This method currently does not update the critic network for the last bootstrap_n timesteps.

bug

**Please describe the purpose of the feature. Is it related to a problem? ** Adding Noisy Networks as an additional choice for exploration in MADQN - Noisy Networks have seen...

enhancement

## What? Added install instruction for people who want to develop for mava to readme. ## Why? I couldn't find this information easily and think it should be in the...

documentation
size/S
priority/low

### Please describe what `needs` to be maintained? We recently found an error in SMAC where we use `dict.values()` on a dictionary where the order of the returned values are...

maintenance

## What? Terminate an experiment when one of the nodes has an error. ## Why? This feature will be helpful in case of running multipple experiments, in case one of...

enhancement
size/M
priority/high

### Describe the bug The trainer keeps sampling even if the executors crash. This means that the trainer must somehow sample data that the executor is not generating. This might...

bug