Mava issues

Results 160 Mava issues

Sort by recently updated

[BUG] Inconsistent Logging of Jax Systems

### Describe the bug Jax system logs look different compared to tf system logs. ### To Reproduce Steps to reproduce the behavior: 1. Run jax example. 2. Run tf example....

KaleabTessera

bug

[FEATURE] Automatic ordering of hook calls in mixin class

### Please describe the purpose of the feature. Is it related to a problem? This is a nitpick issue which might not be needed at all. But if we can...

DriesSmit

enhancement

[BUG] Flatland jax docker image doesn't work

### Describe the bug The current flatland docker image doesn't work. Error: `module 'jaxlib.xla_extension' has no attribute '__path__'` due to version of cloudpickle that is required by gym 0.14 (flatland...

KaleabTessera

bug

[INVESTIGATION] MADDPG/MAD4PG are slower than MAPPO in certain instances (pixel-based environments?)

### What do you want to investigate? MADDPG/ MAD4PG are both significantly slower than MAPPO in certain instances. 1. Run MADDPG on Coop pong/ PCB Grid for n steps with...

AsadJeewa

bug

[INVESTIGATION] Updating DDPG with only the critic network yields better performance

### What do you want to investigate? Updating DDPG with only the critic network yields better performance. For this to be carried out, we need to update the DDPG actor...

AsadJeewa

[FEATURE] Move tf.function in MAPPO

### Feature For a small performance boost we should consider moving the tf.functions from [_minibatch_update](https://github.com/instadeepai/Mava/blob/develop/mava/systems/tf/mappo/training.py#L274) to [_step](https://github.com/instadeepai/Mava/blob/develop/mava/systems/tf/mappo/training.py#L285). ### Testing Validate that the trainer is running faster with this change. To...

DriesSmit

enhancement

[BUG] Net Spec Keys is overwritten in MADDPG

### Describe the bug In the MADDPG system, regardless of the net spec keys given to the network_factory (i.e create_default_networks), the system overwrites these values. This causes issues when trying...

EdanToledo

bug

[FEATURE] piecewise linear epsilon for exploration

### Feature A piecewise linear scheduler for epsilon. With piecewise linear scheduler the user can increase and decrease the epsilon over the desired time intervals. ### Proposal Creating a new...

nima-siboni

enhancement

[BUG] Executors set networks on the variable server.

### Describe the bug The executors in MADDPG pass no `set_keys` argument to the variable client. This means that the `set_keys` defaults to the `get_keys`. We don't want the executors...

DriesSmit

bug

[BUG] Optimizer states are not being stored in the variable_sources.

### Describe the bug This makes loading from a checkpoint and resuming training problematic as new optimizers are initialised each time. ### Solution Add the optimizer states to the variable...

DriesSmit

bug

Mava
Mava copied to clipboard

Metadata

[BUG] Inconsistent Logging of Jax Systems

[FEATURE] Automatic ordering of hook calls in mixin class

[BUG] Flatland jax docker image doesn't work

[INVESTIGATION] MADDPG/MAD4PG are slower than MAPPO in certain instances (pixel-based environments?)

[INVESTIGATION] Updating DDPG with only the critic network yields better performance

[FEATURE] Move tf.function in MAPPO

[BUG] Net Spec Keys is overwritten in MADDPG

[FEATURE] piecewise linear epsilon for exploration

[BUG] Executors set networks on the variable server.

[BUG] Optimizer states are not being stored in the variable_sources.

← Metadata

Owner

Metadata

Mava Mava copied to clipboard

Metadata

← Metadata

Owner

Metadata

Mava
Mava copied to clipboard