Neel Nanda issues

Results 15 issues of


                                            Neel Nanda

[Proposal] Include num_kv_heads in the Model Properties Table in the docs

### Proposal Include num_kv_heads in the Model Properties Table in the docs ### Motivation A fair amount of models now use multi-query attention (ie single key and single value) or...

Add tests + better docs to ActivationCache

Add tests that the methods in the ActivationCache class work correctly. Go through the documentation and clarify things that are unclear (this is hard for me to do, so even...

documentation

help wanted

good first issue

[Proposal] Add Support for Yi-6B and Yi-34B

### Proposal Yi-6B and Yi-34B are new models that make a plausible claim to be the current best open source models, beating Falcon 180B and LLaMA-2 70B on MMLU. It'd...

enhancement

help wanted

Better docs for model properties

Make [this table](https://github.com/neelnanda-io/TransformerLens/blob/main/easy_transformer/model_properties_table.md) better and cover key info for model architecture - whether it uses parallel attn & MLPs, and what positional embedding it is. Add text at the bottom...

documentation

help wanted

good first issue

[Proposal] Have ActivationCache.get_full_resid_decomposition support passing in a vector/tensor to project onto

### Proposal Allow ActivationCache.get_full_resid_decomposition to receive a `project_output_onto` tensor that is either a [d_model] tensor or [d_model, num_outputs] tensor, such we multiply the output by that. Internally, rather than taking...

Add mixed precision inference incl loading

Add the option to load models in bfloat16 and float16. Esp important for large models like GPT-J and GPT-NeoX. Ideally, load from HuggingFace in this low precision, do weight processing...

enhancement

help wanted

good first issue

Add wrapper integrating HookedTransformer with Google's Learning Interpretability Tool (LIT)

Google have a very cool-looking tool for (mostly non-MI) interpretability of language models, called LIT. It seems designed to be framework agnostic, and to be able to take a wrapper...

enhancement

help wanted

good first issue

demo

Add a demo of direct path patching

Direct path patching is like activation patching, but rather than patching in the output of component A, it acts on *pairs* of components A and B (in a layer after...

help wanted

demo

Add tests + better docs for FactoredMatrix

Add tests that the FactoredMatrix class works (essentially that each of its methods correctly mimics the result for the actual matrix product). Go through the documentation and clarify things that...

documentation

help wanted

good first issue

Add tests + better docs for tokenization methods

Add tests that the tokenization methods work (to_tokens, to_string, to_str_tokens, get_token_position) Go through the documentation and clarify things that are unclear (this is hard for me to do, so even...

documentation

help wanted

good first issue