PHC Request for Guidance on Humanoid Evaluation Instability

Request for Guidance on Humanoid Evaluation Instability

Open cold-deuu opened this issue 3 months ago • 1 comments

Hi ZhengyiLuo,

First, thanks for releasing the code. I trained and evaluated H1 with the following commands:

the train command:

python phc/run_hydra.py \
  project_name=Robot_IM \
  robot=unitree_h1 \
  env=env_im_h1_phc env.motion_file=sample_data/sample_dance_h1.pkl \
  learning=im_pnn_big \
  exp_name=unitree_h1_pnn_realsim_092924 \
  sim=robot_sim control=robot_control \
  learning.params.network.space.continuous.sigma_init.val=-1.7

and evaluation command

python phc/run_hydra.py \
  learning=im_pnn_big exp_name=unitree_h1_pnn_eun \
  epoch=-1 test=True \
  env=env_im_h1_phc robot=unitree_h1 \
  robot.freeze_hand=True robot.box_body=False \
  env.motion_file=sample_data/dance_sample_h1.pkl \
  env.num_envs=1 headless=False \
  control=robot_control

During evaluation/visualization, the robot sometimes gets “catapulted” off the ground as if it receives a strong contact impulse, and control breaks down.

I’m mainly wondering about two things:

Will this disappear with more training? I trained on a single A6000 GPU for 3 days. Is this instability something that typically resolves as the policy trains longer, or should I expect stability already at this stage?
Could this be a simulation setup issue? For example, ground/foot restitution, friction, max depenetration velocity, bounce threshold, timestep/substeps, or solver iterations causing large initial contact impulses.

If it’s neither of the above, I’d greatly appreciate any additional pointers you can share (e.g., a known-stable eval config for H1, recommended PD gains/action scaling at eval, foot geometry/material settings, or any differences between your training vs. eval physics settings that are important).

I can also share a short video if helpful. Thank you! Best regards, Seulchan Lee

https://github.com/user-attachments/assets/401f8d3f-4bf9-4300-bf0c-2367cd8117ec

Aug 10 '25 09:08 cold-deuu

PHC PHC copied to clipboard

Request for Guidance on Humanoid Evaluation Instability

PHC
PHC copied to clipboard