data-validation icon indicating copy to clipboard operation
data-validation copied to clipboard

Library for exploring and validating machine learning data

Results 55 data-validation issues
Sort by recently updated
recently updated
newest added

Towards the goal of adding support for computing statistics over structured data (e.g., arbitrary protocol buffers, parquet data), we will populate [`path`](https://github.com/tensorflow/metadata/blob/master/tensorflow_metadata/proto/v0/statistics.proto#L105) for each feature instead of [`name`](https://github.com/tensorflow/metadata/blob/master/tensorflow_metadata/proto/v0/statistics.proto#L102) in the...

Announcement

Does TFDV support reading tf.SequenceExample from TFRecords, inferring a schema over them and compute statistics from them?

stat:awaiting tensorflower
type:feature

stat:awaiting tensorflower
type:feature

Hi Paul, As discussed on SO please find below my feedback on what I think would be nice additions to this already great library: 1) possibility to merge generated stats:...

stat:awaiting tensorflower
type:feature

Hi, Problem: - I am unable to use tfdv with poetry due to dependencies not being resolved. For simplicity and debugging purposes, below are the steps to recreate the issues:...

stat:awaiting response
type:bug

The package joblib from version 0 and before 1.2.0 are vulnerable to Arbitrary Code Execution via the pre_dispatch flag in Parallel() class due to the eval() statement. My PR: [https://github.com/tensorflow/data-validation/pull/225...

stat:awaiting response
type:bug

The package joblib from 0 and before 1.2.0 are vulnerable to Arbitrary Code Execution via the pre_dispatch flag in Parallel() class due to the eval() statement.

Hello, I am trying to display the schema and statistics in a dashboard/UI using streamlit. I have incorporated the code in a .py file. But the visuals are coming in...

stat:awaiting response
type:support

I've been looking through the detectable [anomalies](https://github.com/tensorflow/metadata/blob/master/tensorflow_metadata/proto/v0/anomalies.proto) and realized that I don't think there's a way to accomplish what I'd like to accomplish, which is enforce a distribution constraint on...

stat:awaiting tensorflower
type:feature

Fixed Install NumPy instructions link to "https://numpy.org/install/"