elk Plotting for individual predictions across depth

Plotting for individual predictions across depth

Open norabelrose opened this issue 2 years ago • 0 comments

Given a set of reporters for each layer of a model and a fixed input, we can extract the model's "belief" at each layer and see how it evolves over time, similar to how the tuned lens works.

This is low-ish priority, but I think this should be done in time for the paper at least.

Feb 15 '23 06:02 norabelrose

elk elk copied to clipboard

Plotting for individual predictions across depth

elk
elk copied to clipboard