Joseph Bloom

Results 41 issues of Joseph Bloom

**Describe the bug** We added gated mlps when we provided LLama support (https://github.com/neelnanda-io/TransformerLens/commit/3d03ca5081ff0b7a920ffe7830e2c3da0e6e9d07) however we didn't update the docs or add tests specifically for the GatedMLP component. It's on me...

**Describe the bug** Attribution patching demo: **Code example** Please try to provide a minimal example to reproduce the bug. Error messages and stack traces are also helpful. (see patching section...

bug
help wanted

- [x] 1. Dot product between each output action. - [x] 2. Dot product between each input action. - [ ] 3. Dot product between each time embedding. - [x]...

Basic concept is that we can sample from which heads we actually compute randomly in order to see which matter. Shapley values are usually computed over all subsets of heads....

QK - [ ] State to Action. OV - [] Fix head selection (default to all) - [ ] Find a way to automatically find axes in the OV circuit...

- [ ] Use t-lens naming scheme - [ ] Enable arbitrary combinations of heads and MLPs

- [x] Psychological eval - [ ] Activation Patching for instruction and RTG. -> try to explain - [ ] Work out how to tackle targets (patching same object multiple...

https://www.lesswrong.com/posts/AcKRB8wDpdaN6v6ru/interpreting-gpt-the-logit-lens https://colab.research.google.com/drive/1MjdfK2srcerLrAJDRaJQKO0sUiZ-hQtA?usp=sharing pip install git+https://github.com/finetuneanon/transformers/@gpt-neo-localattention

## Analysis features ### Static Composition - [x] Make composition maps - [x] Replace composition scores with strip plots? - [ ] Create a meta-composition score. Something that measures total...

enhancement