Mava icon indicating copy to clipboard operation
Mava copied to clipboard

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Results 160 Mava issues
Sort by recently updated
recently updated
newest added

### Please describe the purpose of the feature. Is it related to a problem? A tutorial notebook for creating an IPPO system to accompany Mava Jax release. ### Describe the...

enhancement

### Describe the benchmarking experiment/task Benchmark the Mava Jax IPPO system on the same MPE and SMAC tasks as in [this PPO paper](https://arxiv.org/abs/2103.01955). ### Hypothesis/expected behavior or outcome Match the...

benchmark in progress

### Please describe what needs to be maintained? Update the Jax IPPO readme before Jax release. ### Describe the outcome you'd like Jax IPPO readme has an outline of the...

documentation

### Please describe what needs to be maintained? Ensure that all jax ippo functions have detailed and descriptive docstrings and that the code is well commented before the Jax release.

documentation

### Please describe the purpose of the feature. Is it related to a problem? Create an automated pipeline to easily run experiments on the SMAC environments as used in this...

enhancement

### Describe the benchmarking experiment/task The main goal of the Mava JAX release is to match the performance of [this](https://arxiv.org/abs/2103.01955) paper. Our notion page, documenting our progress, can be found...

benchmark in progress

### Please describe the purpose of the feature. Is it related to a problem? This feature would allow the critic network to utilise recurrent neural networks. Recurrent networks should improve...

enhancement

### Feature Make the executor have an environment loop instead of the other way around. It makes more intuitive sense to run the executor instead of the environment loop (which...

enhancement

### What do you want to investigate? Determine a better Mava standard that would make the executors and trainers more composable. We want to make it easy to add new...

enhancement

Are there any examples of how to wrap and use a pettingzoo environment in MAVA?