Why the policy is implemented using a multilayer perceptron (MLP) composed of one layers

Open zxz330 opened this issue 10 months ago • 2 comments

class G1RoughCfgPPO( LeggedRobotCfgPPO ): class policy: init_noise_std = 0.8 actor_hidden_dims = [32] critic_hidden_dims = [32] activation = 'elu' # can be elu, relu, selu, crelu, lrelu, tanh, sigmoid # only for 'ActorCriticRecurrent': rnn_type = 'lstm' rnn_hidden_size = 64 rnn_num_layers = 1

Feb 27 '25 17:02 zxz330

I have the same question.

Apr 09 '25 07:04 forresti

maybe just provide a sample example.

Oct 29 '25 06:10 wangDx1221