Mava
Mava copied to clipboard
Feat:Sebulba [3] ff-ippo
What?
Implement Sebulba architecture with feedforward IPPO on Rware.
Why?
Integrate Sebulba's architecture due to its effectiveness in scenarios involving non-jitted/non-jax environments.
How?
Enhance the existing Cleanba code to support marl algorithm and ensure compatibility with Mava's key components, including logger, evaluator, etc.
Extra
Before reviewing this PR, we need to merge #976 and #977 The early draft of sebulba ff-ippo can be found in #959 Action Item:
- [ ] Fix mypy/pre-commit issues
- [ ] Fix loading a checkpointed model
- [ ] Benchmarking