SDMetrics icon indicating copy to clipboard operation
SDMetrics copied to clipboard

Metrics to evaluate quality and efficacy of synthetic datasets.

Results 104 SDMetrics issues
Sort by recently updated
recently updated
newest added

`get_column_plot` produces histograms which take a lot of liberty when representing the data, especially at the edges. The Real data and the matplotlib plot represent the same data (ignore the...

feature request

### Problem Description The detection metrics for [single table data](https://docs.sdv.dev/sdmetrics/metrics/metrics-in-beta/detection-single-table) and [sequential data](https://docs.sdv.dev/sdmetrics/metrics/metrics-in-beta/detection-sequential) both compute the `AUC (ROC)` and return `1-AUC` as the final score. The score is hard to...

question

### Problem Description What are the different metrics I can use to check quality of PII information produced? report.get_diagnostics() checks the coverage and range of numerical/categorical data. But is there...

feature request

The goal is to propose a new column-pair metric between one numerical and one categorical column. ### Current behavior The Quality report has to discretize the numerical column and do...

feature request
feature:metrics

The goal here is to make the `NewRowSynthesis` metric more fault-tolerant and make it faster/more efficient to run.

internal
feature:metrics

### Problem Description Some metrics such as [StatisticSimilarity](https://docs.sdv.dev/sdmetrics/metrics/metrics-glossary/statisticsimilarity) are defined on a per-column level. If I want to apply it to several columns of several tables at once, I have...

feature request

### Problem Description I am working with a home-grown synthesizer that is able to synthesize relatively rare categorical values (i.e. one that occurs maybe 3 or 4 times in a...

bug
under discussion

### Problem Description I cannot override the synthetic sample size used in the diagnostic report for the NewRowSynthesis metric, for both single-table and multiple-table diagnostic reports. Currently, I am doing...

feature request
under discussion
feature:reports

### Environment Details Please indicate the following details about the environment in which you found the bug: * SDMetrics version: * Python version: * Operating System: ### Error Description metadata1={'fields':...

bug
data:sequential
feature:metrics

The snippet below should be something like: `data[field] = pd.Series(integers, data.index)`. https://github.com/sdv-dev/SDMetrics/blob/c9967494126e6273d3d97ebf8c1b045861a3f126/sdmetrics/utils.py#L199 As currently implemented, the transformed data will incorrectly map the values to the wrong data if the index...

bug
new