SDMetrics issues

`get_column_plot` produces misleading graphs (for uniform-like distributions)

`get_column_plot` produces histograms which take a lot of liberty when representing the data, especially at the edges. The Real data and the matplotlib plot represent the same data (ignore the...

fealho

feature request

Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed?

### Problem Description The detection metrics for [single table data](https://docs.sdv.dev/sdmetrics/metrics/metrics-in-beta/detection-single-table) and [sequential data](https://docs.sdv.dev/sdmetrics/metrics/metrics-in-beta/detection-sequential) both compute the `AUC (ROC)` and return `1-AUC` as the final score. The score is hard to...

npatki

question

Evaluation metrics for synthetic generated PII informations.

1

### Problem Description What are the different metrics I can use to check quality of PII information produced? report.get_diagnostics() checks the coverage and range of numerical/categorical data. But is there...

yash-rathore

feature request

New column-pairs sdmetrics between categorical and numerical data

The goal is to propose a new column-pair metric between one numerical and one categorical column. ### Current behavior The Quality report has to discretize the numerical column and do...

R-Palazzo

feature request

feature:metrics

Re-implement the NewRowSynthesis

The goal here is to make the `NewRowSynthesis` metric more fault-tolerant and make it faster/more efficient to run.

R-Palazzo

internal

feature:metrics

Make it easier to apply a metric to multiple columns/tables

### Problem Description Some metrics such as [StatisticSimilarity](https://docs.sdv.dev/sdmetrics/metrics/metrics-glossary/statisticsimilarity) are defined on a per-column level. If I want to apply it to several columns of several tables at once, I have...

npatki

feature request

sklearn throws ValueError exception

2

### Problem Description I am working with a home-grown synthesizer that is able to synthesize relatively rare categorical values (i.e. one that occurs maybe 3 or 4 times in a...

yoid2000

bug

under discussion

I want to be able to modify synthetic_sample_size in the diagnostic report (both single-table and multi-table).

4

### Problem Description I cannot override the synthetic sample size used in the diagnostic report for the NewRowSynthesis metric, for both single-table and multiple-table diagnostic reports. Currently, I am doing...

echatzikyriakidis

feature request

under discussion

feature:reports

`LSTMDetection` metric crashes when there are multiple context columns

1

### Environment Details Please indicate the following details about the environment in which you found the bug: * SDMetrics version: * Python version: * Operating System: ### Error Description metadata1={'fields':...

Sanchita333

bug

data:sequential

feature:metrics

Semantic error in transforming datetime columns using HyperTransformer

The snippet below should be something like: `data[field] = pd.Series(integers, data.index)`. https://github.com/sdv-dev/SDMetrics/blob/c9967494126e6273d3d97ebf8c1b045861a3f126/sdmetrics/utils.py#L199 As currently implemented, the transformed data will incorrectly map the values to the wrong data if the index...

avsolatorio

bug

new

SDMetrics
SDMetrics copied to clipboard

Metadata

`get_column_plot` produces misleading graphs (for uniform-like distributions)

Detection metrics score doesn't accurately point out quality or privacy. Should the formula be changed?

Evaluation metrics for synthetic generated PII informations.

New column-pairs sdmetrics between categorical and numerical data

Re-implement the NewRowSynthesis

Make it easier to apply a metric to multiple columns/tables

sklearn throws ValueError exception

I want to be able to modify synthetic_sample_size in the diagnostic report (both single-table and multi-table).

`LSTMDetection` metric crashes when there are multiple context columns

Semantic error in transforming datetime columns using HyperTransformer

← Metadata

Owner

Metadata

SDMetrics SDMetrics copied to clipboard

Metadata

← Metadata

Owner

Metadata

SDMetrics
SDMetrics copied to clipboard