langtest
langtest copied to clipboard
Implement Accuracy Drop for Robustness and Bias Tests
This implementation involves comparing the ground truth vs. expected result and the ground truth vs. actual result, where the actual result is derived from a perturbed version of the original text. The goal is to measure the accuracy drop that occurs due to robustness and bias. For more information, refer to the LangTest robustness tests.