automated-interpretability
automated-interpretability copied to clipboard
Make a demo script for getting neuron activations using transformer lens?
(no plans to do this right now, but could be useful)
I added a demo notebook to get the neuron activations using NeuroX