Meta-RL-TwoStep-Task icon indicating copy to clipboard operation
Meta-RL-TwoStep-Task copied to clipboard

A question about TwoStepTask.step() function

Open fangzefunny opened this issue 5 years ago • 0 comments

Thanks for providing such a readable code.

If I am correct, you were compressing stage1 and stage2 into 1 trial. In this case, I think the next_state of this trial should be S1, because each trial should begin with stage1. However, in my simple simulation, the non-ep environment never returns S1 [ 1, 0,0]: image

I wonder if it is a special design for the "incremental" case? I am not sure if I am on the same page with you. Can you offer a brief definition of "incremental", "episodic" in your README?

Thank you very much!!

fangzefunny avatar Sep 09 '20 04:09 fangzefunny