SDMetrics issues

Should fairness metrics (aka de-biasing metrics) assume you're doing data augmentation?

### Environment details * SDMetrics version: 0.21.0 ### Description We have an upcoming fairness metric called [EqualizedOddsImprovment](https://docs.sdv.dev/sdmetrics/metrics/privacy-and-fairness-metrics/equalizedoddsimprovement). This is meant to indicate whether the synthetic data is improving the fairness...

npatki

question

When data augmentation is recommended, should the metrics do the augmentation internally? Or should the user do it beforehand?

### Environment details * SDMetrics version: 0.21.0 ### Background In certain metrics like [BinaryClassifierPrecisionEfficacy](https://docs.sdv.dev/sdmetrics/metrics/ml-augmentation-metrics/binaryclassifierprecisionefficacy) and [EqualizedOddsImprovement](https://docs.sdv.dev/sdmetrics/metrics/privacy-and-fairness-metrics/equalizedoddsimprovement), the user is generally interested in _augmenting_ the real data with synthetic data. So...

npatki

question

Allow me to control randomization when using the `DCRBaselineProtection` metric

### Problem Description The `DCRBaselineProtection` metric measures the privacy of synthetic data by comparing it against random data. The random data is created by uniformly sampling in the real data's...

npatki

feature request

Update BNLikelihood metrics to use pomegranate 1.x

### Problem Description While #701 updated the installation instructions needed to use `BNLikelihood` and `BNLogLikelihood` to fix the errors users were encountering using 1.x or 0.14.x versions of pomegranate, it...

rwedge

feature request

Investigate performance limits of `DisclosureProtection` metric

### Problem Description Currently, the `DisclosureProtection` metric warns about poor performance when the size of the input data is greater than 50,000 rows. This number was chosen without investigation into...

frances-h

feature request

Investigate options for InterRowMSAS

### Problem Description Right now, the InterRowMSAS metric takes the direct difference between a value in row `n` and row `n+1`. Then, it averages out all these differences. As a...

npatki

feature request

data:sequential

SDMetrics
SDMetrics copied to clipboard

Metadata

Should fairness metrics (aka de-biasing metrics) assume you're doing data augmentation?

When data augmentation is recommended, should the metrics do the augmentation internally? Or should the user do it beforehand?

Allow me to control randomization when using the `DCRBaselineProtection` metric

Update BNLikelihood metrics to use pomegranate 1.x

Investigate performance limits of `DisclosureProtection` metric

Investigate options for InterRowMSAS

← Metadata

Owner

Metadata

SDMetrics SDMetrics copied to clipboard

Metadata

Should fairness metrics (aka de-biasing metrics) assume you're doing data augmentation?

When data augmentation is recommended, should the metrics do the augmentation internally? Or should the user do it beforehand?

Allow me to control randomization when using the `DCRBaselineProtection` metric

Update BNLikelihood metrics to use pomegranate 1.x

Investigate performance limits of `DisclosureProtection` metric

Investigate options for InterRowMSAS

← Metadata

Owner

Metadata

SDMetrics
SDMetrics copied to clipboard