data-validation icon indicating copy to clipboard operation
data-validation copied to clipboard

Library for exploring and validating machine learning data

Results 55 data-validation issues
Sort by recently updated
recently updated
newest added

why does below code shows mismatch in distribution. Does Jensen-Shannon divergence sensitive to number of samples? def show_anomalies(train_data, test_data): train_stats = tfdv.generate_statistics_from_dataframe(train_data) test_stats = tfdv.generate_statistics_from_dataframe(test_data) schema = tfdv.infer_schema(statistics=train_stats) for f...

stat:awaiting response
type:support

Hi, After searching online whether tfdv could be used to validate data that contains text. For instance, for a dataset with sentences that have to be mapped to labels. I...

type:docs
stat:awaiting tensorflower

## Overview I'm having issues specifying the features to include/exclude when visualizing stats in TFDV. It seems like the `allowlist_features` and `denylist_features` require a `tensorflow_data_validation.types.FeaturePath` object, which took a bit...

stat:awaiting tensorflower
type:bug

Hi, I recently checked the TensorFlow Data Validation paper (https://mlsys.org/Conferences/2019/doc/2019/167.pdf). First of all, thanks for the publishing the paper and open-sourcing this project. But I cannot find similar features in...

stat:awaiting tensorflower
type:feature

Installation fails on M1 mac for tfx-bsl, tft and tfdv.

stat:awaiting tensorflower
type:build/install

For whatever reason when trying to start a dataflow job for tfdv.generate_statistics_from_csv using gc storage, doesn't work in this version for me (it fails on the fourth step every time)....

stat:awaiting tensorflower
type:bug

TensorFlow Data Validation is a great tool to look at the data. One feature that might make it even better is if it would also compute correlations among the variables,...

type:feature

Versions: tensorflow 2.6.0 py38h52b2510_1 conda-forge tensorflow-base 2.6.0 py38h1615122_1 conda-forge tensorflow-data-validation 1.4.0 pypi_0 pypi tensorflow-datasets 4.4.0 pypi_0 pypi tensorflow-estimator 2.6.0 py38h02c4698_1 conda-forge tensorflow-metadata 1.4.0 pypi_0 pypi tensorflow-serving-api 2.6.0 pypi_0 pypi ```...

stat:awaiting tensorflower
type:bug