Mava
Mava copied to clipboard
🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX
### What do you want to test? Unit test for the `MAPGEpochUpdateSeparateNetworks` and `MAPGMinibatchUpdateSeparateNetworks` components of the Jax PPO implementation that makes use of separate critic and policy netowrks. ###...
### What do you want to test? Unit test for the `MAPGWithTrustRegionStepSeparateNetworks` component of the Jax PPO implementation that makes use of separate critic and policy netowrks. ### Outline of...
### What do you want to test? Unit test for the `MAPGWithTrustRegionClippingLossSeparateNetworks` component of the Jax PPO implementation that makes use of separate critic and policy netowrks. ### Outline of...
## What? Updates to current Jax MAPPO implementation to match the Tensorflow MAPPO implementation. ## Why? We firstly want to be able to compare the Jax system against the Tensorflow...
## What? Full distributed Jax mappo system integration test. ## Why? Part of on going efforts to increase testing code coverage of callback redesigned Mava. ## Extra closes #542 In...
### Feature A clear and concise description of the feature to be implemented and why. ### Proposal A plan or brief overview of how the feature will be implemented. ###...
## What? Executor integration test ## Why? Part of on going efforts to increase testing code coverage of callback redesigned Mava. ## Extra Close #643
### What do you want to investigate? In `mava/systems/jax/launcher.py`, the launcher allows to add nodes with the same name in the case of multi-process and does not allow it in...
### Please describe what needs to be maintained. step.py has too many components, some of which are independent i.e. `Step` and `TrainerStep` both extend `Component` ### Describe the outcome you'd...
### Please describe what needs to be maintained The Guardrails PR https://github.com/instadeepai/Mava/pull/596 highlighted some non-breaking issues that we should revisit. data:image/s3,"s3://crabby-images/9d645/9d6453579f2a7160cf166fb032f1e2226f9cdc1b" alt="180448888-344909c9-53fb-4f86-8df3-291f0723da39" - `Step` and `EpochUpdate` are both dependent on each...