DecisionTransformerInterpretability
DecisionTransformerInterpretability copied to clipboard
Write a Probe Environment that tests a models ability to look at previous observations
We currently have 5 probe environments for single timestep models and I'd like a prob environment to test if a model can learn:
- to take the correct action as a function of a previous observation
- to test that the model doesn't learn the correct action if it's outside the recurrence range or something (more speculative)