elk
elk copied to clipboard
Plotting for individual predictions across depth
Given a set of reporters for each layer of a model and a fixed input, we can extract the model's "belief" at each layer and see how it evolves over time, similar to how the tuned lens works.
This is low-ish priority, but I think this should be done in time for the paper at least.