Arthur Conmy issues

Results 21 issues of


                                            Arthur Conmy

[Bug Report] Fix `n_params` counts

**Describe the bug** The `n_params` counts calculated [here](https://github.com/neelnanda-io/TransformerLens/blob/f5a7d455546a88cfdfb26e781d5bd6447e8243eb/transformer_lens/HookedTransformerConfig.py#L242) are wrong. For example, LLAMA uses SwiGLU so the 2x factor in the [linked code ](https://github.com/neelnanda-io/TransformerLens/blob/f5a7d455546a88cfdfb26e781d5bd6447e8243eb/transformer_lens/HookedTransformerConfig.py#L242) is wrong. Further this just ignores...

bug

Add Llama-2 models

Addressing in https://github.com/neelnanda-io/TransformerLens/pull/352 [x] Implemented Llama-2-7B and Llama-2-13B [ ] Implement Llama-2-70B architecture (add Grouped-Query Attention)

Add automatic notebook generation

We should be able to use the python script here: https://github.com/nojvek/vscode-ipynb-py-converter to i) write notebooks as `.py` files with `#%%` ii) have these automatically converted to `.ipynb` files on push...

enhancement

good first issue

[Proposal] [Low-Priority] Improve LN1's hooks

**Background**: When I implemented `use_split_qkv_input` [here](https://github.com/neelnanda-io/TransformerLens/pull/158), I changed the attention module to take three inputs (to query, key and value). Even when this feature is not enabled, **we compute the...

enhancement

low-priority

[Proposal] Provide better print-outs for current hooks attached

### Proposal Add a feature to provide detailed print-outs of currently attached hooks to a model and a HookPoint. ### Motivation Sometimes I want to look at what hooks I've...

enhancement

`register_hook` documentation needs improvement

## 📚 Documentation The formatting https://pytorch.org/docs/stable/generated/torch.Tensor.register_hook.html of `Tensor.register_hook` is strange, the link after this is broken, and the code example should specify `gradient=` to the backward call, as this is...

Be more efficient with corrupted caching

```python for k in exp.global_cache.corrupted_cache.keys(): print(k, exp.global_cache.corrupted_cache[k].shape, k in exp.global_cache.online_cache) ``` returns lots of unnecessary things: ``` blocks.0.ln1.hook_scale torch.Size([40, 41, 8, 1]) False blocks.0.ln1.hook_normalized torch.Size([40, 41, 8, 512]) False ......

Upgrade to ARENA's IOIDataset

[This](https://github.com/callummcdougall/ARENA_2.0/blob/4cda66b64b48dbabdc6b0bd6f5d7a86eea375507/chapter1_transformers/exercises/part3_indirect_object_identification/ioi_dataset.py#L532) file has several nicer methods for easy IOI usage

Add head only version

Will close https://github.com/ArthurConmy/Automatic-Circuit-Discovery/issues/73 However, relies on a PR to TransformerLens here https://github.com/neelnanda-io/TransformerLens/pull/336 (the poetry is updated in the ACDC PR to install this version of TransformerLens)

Use neato

`neato -s1 -Tpdf -ogreaterthan_mlp.pdf greaterthan_mlp.gv` generates much nicer plots: [greaterthan_mlp.pdf](https://github.com/ArthurConmy/Automatic-Circuit-Discovery/files/11782518/greaterthan_mlp.pdf)