AMP_for_hardware icon indicating copy to clipboard operation
AMP_for_hardware copied to clipboard

How to balance task_reward and style_reward

Open hanzhi0410 opened this issue 9 months ago • 2 comments

I want to use amp training to walk on complex terrain. Have you tried it before? What parameters do you think have a significant impact on terrain adaptability

hanzhi0410 avatar May 13 '24 07:05 hanzhi0410

I want to use amp training to walk on complex terrain. Have you tried it before? What parameters do you think have a significant impact on terrain adaptability

Do you have any ideas? I am also trying to train walking//running on terrain. I wonder the composition of the task reward and the balance between task reward and style reward.

yinkangning0124 avatar Jun 05 '24 04:06 yinkangning0124

I want to use amp training to walk on complex terrain. Have you tried it before? What parameters do you think have a significant impact on terrain adaptability

Do you have any ideas? I am also trying to train walking//running on terrain. I wonder the composition of the task reward and the balance between task reward and style reward.

This balance is difficult to quantify, and I believe it is related to many factors. Therefore, during training, I continuously increased the style reward from scratch, and made choices through multiple experimental results. I also found that excessive style rewards can affect the robot's terrain robustness and speed following tasks .

hanzhi0410 avatar Jun 05 '24 05:06 hanzhi0410