AMP_for_hardware
AMP_for_hardware copied to clipboard
How to balance task_reward and style_reward
I want to use amp training to walk on complex terrain. Have you tried it before? What parameters do you think have a significant impact on terrain adaptability
I want to use amp training to walk on complex terrain. Have you tried it before? What parameters do you think have a significant impact on terrain adaptability
Do you have any ideas? I am also trying to train walking//running on terrain. I wonder the composition of the task reward and the balance between task reward and style reward.
I want to use amp training to walk on complex terrain. Have you tried it before? What parameters do you think have a significant impact on terrain adaptability
Do you have any ideas? I am also trying to train walking//running on terrain. I wonder the composition of the task reward and the balance between task reward and style reward.
This balance is difficult to quantify, and I believe it is related to many factors. Therefore, during training, I continuously increased the style reward from scratch, and made choices through multiple experimental results. I also found that excessive style rewards can affect the robot's terrain robustness and speed following tasks .