activation-patching topic

List activation-patching repositories

pyvene

627
Stars
61
Forks
Watchers

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions