brax issues

Request to Add Configurable Action Distribution in PPO Training

Hi team, I'd like to propose a small but helpful modification to the PPO training setup in Brax. Currently, in `brax/training/agents/ppo/networks.py`, the action distribution used by default is hardcoded to...

JummerCloth

brax/training/agents/ppo/train.py fails to JSON serialize the config during checkpoint saving.

1

I encountered this bug when running the MuJoCo Playground tutorial with the following command: `python learning/train_jax_ppo.py --env_name CartpoleBalance` The above command effectively runs `brax/training/agents/ppo/train.py`. I resolved the bug by referring...

syamamori

Update DomainRandomizationVmapWrapper (google#609)

3

This change ensures that the original system (env.sys) remains unchanged during JAX tracing. This prevents the UnexpectedTracerError that would occur when wrapping the same environment twice. Tests: - Training on...

AndruGomes13

[BUG] - UnexpectedTracerError in Brax PPO with Domain Randomization (mjx backend)

4

Hi, I've encountered an `UnexpectedTracerError` when using Brax's PPO implementation with the mjx backend under specific conditions: **Conditions:** - No additional `eval_env` provided. - Using a `randomization_fn` that depends on...

AndruGomes13

enhancement

Manipulator Agnostic Gripper Control for Reinforcement Learning

Hi Brax Team, I am unsure rather to ask this in the MuJoCo repository or this, so please excuse me if this question is misplaced :) I am looking to...

vmstavens

Cannot use 'mjx' and 'generalized' backend

I am trying to run this simple code: ``` from brax import envs env_name = 'ant' backend = 'mjx' env = envs.get_environment(env_name=env_name, backend=backend) print(env.observation_size) print(env.action_size) state = jax.jit(env.reset)(rng=jax.random.PRNGKey(seed=0)) ``` which...

eleninisioti

AndruGomes13

bug

[Bug] (and solution?) PPO policy training and saving fails in latest Brax version

Hi, There appears to be a bug in the latest version of Brax when training and saving a policy using PPO `af646c6`. The error occurs during the save step of...

ccdonosoo

bug

brax
brax copied to clipboard

Metadata

Request to Add Configurable Action Distribution in PPO Training

brax/training/agents/ppo/train.py fails to JSON serialize the config during checkpoint saving.

Update DomainRandomizationVmapWrapper (google#609)

[BUG] - UnexpectedTracerError in Brax PPO with Domain Randomization (mjx backend)

Manipulator Agnostic Gripper Control for Reinforcement Learning

Cannot use 'mjx' and 'generalized' backend

Problems since using bc algorithms

Method to train with behavier cloning

EpisodeWrapper only preserves the last sub‑step’s metrics when using action_repeat

[Bug] (and solution?) PPO policy training and saving fails in latest Brax version

← Metadata

Owner

Metadata

brax brax copied to clipboard

Metadata

← Metadata

Owner

Metadata

brax
brax copied to clipboard