elk-generalization
elk-generalization copied to clipboard

Published 20 hours ago •

→

Metadata

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Reame
Issues

Results 1 elk-generalization issues

Sort by recently updated

Support logging of accuracy for non-constant answer choices

Huggingface evaluation doesn't allow passing of the answer choices to the evaluation metric function, so we are currently asserting that the answer choices for all examples are the same.

AlexTMallen

About

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Stars

Forks

Watchers

Owner

EleutherAI

← Metadata

Stars

Forks

Watchers

Owner

EleutherAI

Metadata

Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard

Back

elk-generalization elk-generalization copied to clipboard

Metadata

Support logging of accuracy for non-constant answer choices

← Metadata

Owner

Metadata

elk-generalization
elk-generalization copied to clipboard