Mava issues

Results 160 Mava issues

Sort by recently updated

[BUG] Crash if you set num_eval_episodes to specific values due to key splitting

Title says it all. On TPUv3 try setting num_eval_episodes = 1

bug

[FEATURE] Move pre-commit hooks to ruff

### Feature There have been many discussions to move our python tooling to [ruff](https://github.com/astral-sh/ruff), as seen in many popular libraries in the community. This will be a big once-off effort,...

callumtilbury

enhancement

[FEATURE] Generalise Evaluator Fn

### Please describe the purpose of the feature. Is it related to a problem? It would be very nice if the evaluator function was generic enough for future algorithms. It...

EdanToledo

enhancement

Use `TrainState` in evaluator

As discussed in #994 we should now be able to use the `MetricsWrapper` and the `TrainState` in the evaluator instead of manually recording the episode return and creating a special...

sash-a

enhancement

priority/low

[BUG] Fix termination vs truncation mixup

### Describe the bug Seem like we are not using the correct termination vs truncation values, we're always using the condition `termination or truncation` (`timestep.last()`) when we often want to...

sash-a

bug

priority/high

Feat:Sebulba [3] ff-ippo

## What? Implement Sebulba architecture with feedforward IPPO on Rware. ## Why? Integrate Sebulba's architecture due to its effectiveness in scenarios involving non-jitted/non-jax environments. ## How? Enhance the existing [Cleanba](https://github.com/vwxyzjn/cleanba)...

OmaymaMahjoub

size/XXL

benchmark required

[MAINTAIN] Better equality check for jaxmarl specs

### Please describe what needs to be maintained? This came up in #955. Unfortunately it's quite messy to check the equality of two jaxmarl spaces as they don't have custom...

sash-a

maintenance

Decouple `LogEnvState`

Having the `LogWrapper` and `LogEnvState` means that we often have to do `state.env_state` which is quite confusing. So just tracking the logging metrics separately in it's own state would make...

sash-a

enhancement

Create an update_network function to reduce code duplication

### Feature Because of the fact that we have a separate actor and critic, we need to update both networks separately, this is leading to a lot of code duplication,...

sash-a

enhancement

feat: cnn support for recurrent systems

Previously it was not possible to use a CNN in Mava's recurrent systems. This PR makes CNNs compatible with recurrent systems and adds a relevant config for a recurrent CNN...

SimonDuToit

size/M

Mava
Mava copied to clipboard

Metadata

[BUG] Crash if you set num_eval_episodes to specific values due to key splitting

[FEATURE] Move pre-commit hooks to ruff

[FEATURE] Generalise Evaluator Fn

Use `TrainState` in evaluator

[BUG] Fix termination vs truncation mixup

Feat:Sebulba [3] ff-ippo

[MAINTAIN] Better equality check for jaxmarl specs

Decouple `LogEnvState`

Create an update_network function to reduce code duplication

feat: cnn support for recurrent systems

← Metadata

Owner

Metadata

Mava Mava copied to clipboard

Metadata

← Metadata

Owner

Metadata

Mava
Mava copied to clipboard