OpenAdapt icon indicating copy to clipboard operation
OpenAdapt copied to clipboard

Pseudonymization (aka. Maintain Identity semantics during Scrub)

Open atomicrichard opened this issue 2 years ago • 5 comments

Feature request

When calling scrub.scrub_text, instead of just replacing with ***, we would like to retain semantic meaning. e.g. given:

Alice said hi to Bob

We want:

<PersonA> said hi to <PersonB>

and not just:

*** said hi to ***

@KrishPatel13 🙏

Motivation

I'm always frustrated when [...] so this feature would [...].

atomicrichard avatar Jun 14 '23 23:06 atomicrichard

Nice Article which describes the Pseudonymization (related Library: OpenRedact)

Link: https://medium.com/@openredact/anonymizer-a-framework-for-text-anonymization-499855f639d4 image

KrishPatel13 avatar Jun 29 '23 05:06 KrishPatel13

Must Read: https://github.com/MLDSAI/OpenAdapt/issues/330#issuecomment-1612437801

KrishPatel13 avatar Jun 29 '23 05:06 KrishPatel13

Tested OpenRedact/anonymizer (but the there is no API refrence, no docs) and also had issues on testing it manuallly.

Then proceeded to find another psuedonymization library, and found about Cape-Privacy

Good Choice Cape-Privacy for OpenAdapt: https://docs.capeprivacy.com/getting-started/

KrishPatel13 avatar Jul 21 '23 17:07 KrishPatel13

Shall we close this @abrichr as completed ?

KrishPatel13 avatar Jul 14 '24 15:07 KrishPatel13

@KrishPatel13 can you please link to where this has been implemented?

abrichr avatar Jul 15 '24 15:07 abrichr