gym icon indicating copy to clipboard operation
gym copied to clipboard

[Proposal] Initial State Tracking - Bipedal Walker Hardcore

Open arthur-plautz opened this issue 1 year ago • 0 comments

Proposal

Create methods that are able to retrieve and set, the initial states that uniquely represents the terrain created for Bipedal Walker Hardcore.

Motivation

I'm currently working on a research on curriculum learning, based on a paper that used the bipedal walker hardcore problem to demonstrate that a selection on the initial conditions of the experiment could improve the overall performance. My current work implies extracting the states used to create the terrain, so that I can use it to automatically select this initial states along the experiment, and also set the states to generate a specific terrain. I was working on some changes in the bipedal environment locally, but it occurred to me that this could benefit other people that may have the same issue.

Pitch

The idea is to have a parameter in the reset method that allows the overwriting of the states used to generate the terrain, and also the persistence of those states, available through an attribute, similar to the terrain attribute.

Alternatives

We may have other ideas of how to do it, but I think the solution would involve a method to set the states and an attribute to access them. Feel free to make your suggestions :smile:

Additional context

The paper: Automated curriculum learning for embodied agents a neuroevolutionary approach.

Checklist

  • [x] I have checked that there is no similar issue in the repo (required)

arthur-plautz avatar Oct 16 '23 23:10 arthur-plautz