anonymizer
anonymizer copied to clipboard
Library for identification, anonymization and de-anonymization of PII data
Bumps [wheel](https://github.com/pypa/wheel) from 0.37.0 to 0.38.1. Changelog Sourced from wheel's changelog. Release Notes UNRELEASED Updated vendored packaging to 22.0 0.38.4 (2022-11-09) Fixed PKG-INFO conversion in bdist_wheel mangling UTF-8 header values...
Allows specifying output file format to be csv or parquet
When a glob is specified as an input, it should ingest all files that matches into the anonymizer.
Bumps [pyspark](https://github.com/apache/spark) from 3.1.2 to 3.2.2. Commits 78a5825 Preparing Spark release v3.2.2-rc1 ba978b3 [SPARK-39099][BUILD] Add dependencies to Dockerfile for building Spark releases 001d8b0 [SPARK-37554][BUILD] Add PyArrow, pandas and plotly to...
Bumps [numpy](https://github.com/numpy/numpy) from 1.21.2 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...
Bumps [ipython](https://github.com/ipython/ipython) from 7.27.0 to 7.31.1. Commits e321e76 release 7.31.1 67ca2b3 Merge pull request from GHSA-pq7m-3gw7-gq5x 2794330 back to dev be343e7 release 7.31.0 0fcf2c4 Merge pull request #13428 from meeseeksmachine/auto-backport-of-pr-13427-on-7.x...
- [x] acquire - [x] analyze - [x] anonymize ~~report~~ - [x] write Acceptance criteria: output from spark is the same as from pandas previously
If the directory in the output path specified in the config file does not exist then there was an error. This pr checks if the output directory exists and creates...
As far as I can see, if the directory in the output path specified in the config file does not exist then there is an error as like: ``` FileNotFoundError:...