weave feat(weave): LangFair Integration

Description

This PR creates two scorer classes; CounterfactualScorer & ToxicityScorer. These scorers compute counterfactual and toxicity metrics supported by the LangFair.

Addresses: https://github.com/wandb/weave/issues/4039

What does the PR do? Include a concise description of the PR contents.

This PR includes two Scorer classes, unit tests for these classes, & example notebook to illustrate a working implementation of these scorer (note: we will remove/change location as per reviewers suggestion).

Counterfactual Scorer: Input an LLM propmts to this class, & this class will identify protected words (gender or race related) in the prompts/questions, create counterfactual prompts, generate counterfactual responses, and compute metric values supported by LangFair ('Cosine', 'RougeL', 'Bleu', 'Sentiment Bias'). Toxicity Scorer: This class gives an measure of toxicity present in the LLM response using a classifier supported by LangFair.

Testing

How was this PR tested?

The PR was tested using the unit tests that can be find in following files

tests/scorers/test_counterfactual_scorer.py tests/scorers/test_toxicity_scorer.py

The test cassettes are available in following directory (tests/integration/langfair/cassettes/langfair_test). These cassettes are generated using tests in "tests/integration/langfair/langfair_test.py" file, which contains all the unit tests from test_counterfactual_scorer.py and test_toxicity_scorer.py.

Apr 04 '25 19:04 mohitcek

This PR requires manual approval from a wandb user to run all CI checks.

To see the current diff, click here.

To approve CI for this PR as of this commit, comment:

/approve 4394f204efafa3e45180426805f66a7d8b9256c3

Apr 04 '25 19:04 circle-job-mirror[bot]

Preview this PR with FeatureBee: https://beta.wandb.ai/?betaVersion=04764985904517d17a73e6c3aaa2a03d23737a12

Apr 04 '25 19:04 circle-job-mirror[bot]

Initial comments by abraham-leal

Hi guys! Thank you so much for the PR! There are a few procedural things to take care of here:

[x] 1. Please generate test casettes with our testing frame work, more information here: https://github.com/wandb/weave/blob/master/CONTRIBUTING.md#testing and here: https://github.com/wandb/weave/tree/365b0b45e68a92ec8abb76425637c6a0ca9ffcd5/weave/integrations#readme
[x] 2. Please ensure to use nox to lint
[x] 3. Please add any dependencies in the pyproj under optional in the scorer section: https://github.com/wandb/weave/blob/master/pyproject.toml
[x] 4. Instead of a notebook, please add an integration description under local scorers in our docs, a sample PR of another scorer doing this is here: https://github.com/wandb/weave/pull/3698/files
[x] 5. Please change the scorer name to indicate its langfair provenance, something like LangfairToxicityScorer or something like that

After those changes, it will be a much more complete PR and we can test it locally and in our suite :) THANK YOU SO MUCH AGAIN.

Apr 04 '25 19:04 mohitcek

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security	Vulnerability	Quality	Maintenance	License
	pypi/langfair@0.6.7

View full report

Apr 10 '25 16:04 socket-security[bot]

[!WARNING] Review the following alerts detected in dependencies.

According to your organization's Security Policy, it is recommended to resolve "Warn" alerts. Learn more about Socket for GitHub.

Action	Severity	Alert (click "▶" to expand/collapse)
Warn		`pypi/[email protected]` has a License Policy Violation. License: License :: Other/Proprietary License (langfair-0.6.7/PKG-INFO) From: pyproject.toml → `pypi/[email protected]` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `[email protected]`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/[email protected]`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.
Warn		`pypi/[email protected]` has a License Policy Violation. License: License :: Other/Proprietary License (langfair-0.6.7.dist-info/METADATA) From: `?` → `pypi/[email protected]` ℹ Read more on: This package \| This alert \| What is a license policy violation? Next steps: Take a moment to review the security alert above. Review the linked package source code to understand the potential risk. Ensure the package is not malicious before proceeding. If you're unsure how to proceed, reach out to your security team or ask the Socket team for help at `[email protected]`. Suggestion: Find a package that does not violate your license policy or adjust your policy to allow this package's license. Mark the package as acceptable risk. To ignore this alert only in this pull request, reply with the comment `@SocketSecurity ignore pypi/[email protected]`. You can also ignore all packages with `@SocketSecurity ignore-all`. To ignore an alert for all future pull requests, use Socket's Dashboard to change the triage state of this alert.

View full report

Apr 10 '25 16:04 socket-security[bot]

@abraham-leal should be ready for review now!

Apr 15 '25 16:04 dylanbouchard

Hi @abraham-leal, I have updated the PR as per your comments. Also, the base branch is set to master branch (wandb:master), I believe that is correct, right?

Apr 21 '25 16:04 mohitcek

hey @abraham-leal - let us know if you need anything else from us on this one. Thank you!

May 01 '25 13:05 dylanbouchard

Hey @mohitcek can you provide a link to a public workspace to allow us to see the integration in action please?

Jun 03 '25 19:06 abraham-leal

Hey @mohitcek can you provide a link to a public workspace to allow us to see the integration in action please?

Hi @abraham-leal, I appreciate any help on this; perhaps more details or an example would be helpful.

Jun 03 '25 20:06 mohitcek

Hey @mohitcek can you provide a link to a public workspace to allow us to see the integration in action please?

Hi @abraham-leal , here are links to public workspace

Toxicity assessment: https://wandb.ai/mohitcek-cvs-health/LangFair%20Toxicity%20Score/weave/traces?view=traces_default
Counterfactual assessment: https://wandb.ai/mohitcek-cvs-health/LangFair%20Counterfactual%20Score/weave/traces?view=traces_default

Jul 31 '25 18:07 mohitcek

@abraham-leal @tssweeney let us know if you have any questions or feedback for us. Appreciate your review!

Aug 11 '25 14:08 dylanbouchard

@abraham-leal any updates on this PR? Please let us know.

Sep 07 '25 15:09 dylanbouchard