rand_param_envs
rand_param_envs copied to clipboard
Regarding the Walker2d-Randparams Environment
Thank you for your inspirational great work and open sourcing the code. I have a question regarding the walker random environment. The walker can not walk when the reward is 800, this is the case for all the algorithms that use this environment. When doing a comparative analysis are we supposed to only compare the reward or is the reward scaled for this environment?
Have you observed a walking behavior when using the algorithms that use this environment?