anonymizer icon indicating copy to clipboard operation
anonymizer copied to clipboard

Library for identification, anonymization and de-anonymization of PII data

Results 29 anonymizer issues
Sort by recently updated
recently updated
newest added

As part of this story, we would build an anonymizer action that deals with dropping of the PII elements in the column values.

Enhancement
Story size : 1

Add a section in the schema that can hold the information on whether a given column has to be taken into account for PII detection or not.

Enhancement
Story size : 2

Build a pool for identifying personal contact information - [x] Phone number - [x] Email

Enhancement
Story size : 2

Capability to identify following PII data related to Singaporeans - [x] NRIC / FIN - [ ] Work Permit Id - [ ] Passport

Enhancement
Story size : 3

parse a delimited file output as columns / pandas dataframe

Enhancement
Story size : 1

Design a contract for supporting different inputs and formats Decide on a viable output format for the next step (ex pandas df)

Story size : 1
tech task

Take parser output, Split it if required in an efficient way Run it against all the available regex matchers Record the result whether a particular cell is PII or not

Enhancement
Story size : 3

Given the findings from the regex matchers, generate a report that is user friendly that displays : 1. Columns with PII data 2. Low level granularity - showing which cells...

Enhancement
Story size : 2

Compare support for : 1. basic PII like name, email, phone number, NRIC, etc 2. custom PII identification 3. localization (esp. Asia) - support for PDPA policies 4. free-text PII...

Spike