DecisionTransformerInterpretability
DecisionTransformerInterpretability copied to clipboard
Add static interpretability visualizations to wandb dashboard.
Add static interpretability visualizations to wandb dashboard. Seems like a cool idea I just had.
Some stuff: QK/OV circuit viz, different attribution embeddings, time embedding, L2 Norms of different components.