Metaworld
Metaworld copied to clipboard
Reconfirm the environment & task & seed & Successs Rate
@avnishn
Noticing that there are lots of issues about the environment & task, I am sorry to open this type of question again. Reading almost of them, I want to confirm whether my understanding is correct.
-
In my understanding, in meta-RL settings, such as ML10 and ML45. You set 10 or 45 meta-training environments and 5 meta-test environments, each with 50 tasks. Is it True?
-
In the Figure 4-8, "seed (N) = 10" is also used. What's is the seed? Does it mean that in one environment, like reach-v2, you set 50 different variants/tasks (with different initial object and goal positions) and each variant/task you repeat 10 times?
-
Based on the above understanding of seeds, I think that 500 (50 tasks * 10 seeds) total epsisodes/trajecories/rollouts are reached for each environment. Therefore the Success Rate = # success episidoes/ 500. Is it true?