DecisionTransformerInterpretability icon indicating copy to clipboard operation
DecisionTransformerInterpretability copied to clipboard

Write a Probe Environment that tests a models ability to look at previous observations

Open jbloomAus opened this issue 1 year ago • 1 comments

We currently have 5 probe environments for single timestep models and I'd like a prob environment to test if a model can learn:

  1. to take the correct action as a function of a previous observation
  2. to test that the model doesn't learn the correct action if it's outside the recurrence range or something (more speculative)

jbloomAus avatar Apr 04 '23 00:04 jbloomAus