DecisionTransformerInterpretability
DecisionTransformerInterpretability copied to clipboard
Store checkpoints on wandb during offline training
Closes #39
Similar to how storing of PPO checkpoints works, the number of checkpoints can be set using a command line argument.