automated-interpretability topic
List
automated-interpretability repositories
automated-explanations
36
Stars
6
Forks
Watchers
Generating and validating natural-language explanations.