DecisionTransformerInterpretability Reverse Logit Lense

Reverse Logit Lense

Open jbloomAus opened this issue 1 year ago • 0 comments

https://www.lesswrong.com/posts/AcKRB8wDpdaN6v6ru/interpreting-gpt-the-logit-lens

https://colab.research.google.com/drive/1MjdfK2srcerLrAJDRaJQKO0sUiZ-hQtA?usp=sharing

pip install git+https://github.com/finetuneanon/transformers/@gpt-neo-localattention

May 22 '23 15:05 jbloomAus