data-tools
data-tools copied to clipboard
Add `dataprofiler`
What is this tool for?
The DataProfiler is a Python library designed to make data analysis, monitoring, and sensitive data detection easy.
Loading Data with a single command, the library automatically formats & loads files into a DataFrame. Profiling the Data, the library identifies the schema, statistics, entities (PII / NPI) and more. Data Profiles can then be used in downstream applications or reports.
Resources
- Docs: capitalone.github.io/DataProfiler
- Repo: https://github.com/capitalone/DataProfiler
Bump
@victorcouste, would love to see if we can get this merged in to your repo! Glad to answer any questions about Data Profiler. Thanks!