OpenAdapt
OpenAdapt copied to clipboard
Pseudonymization (aka. Maintain Identity semantics during Scrub)
Feature request
When calling scrub.scrub_text, instead of just replacing with ***, we would like to retain semantic meaning. e.g. given:
Alice said hi to Bob
We want:
<PersonA> said hi to <PersonB>
and not just:
*** said hi to ***
@KrishPatel13 🙏
Motivation
I'm always frustrated when [...] so this feature would [...].
Nice Article which describes the Pseudonymization (related Library: OpenRedact)
Link: https://medium.com/@openredact/anonymizer-a-framework-for-text-anonymization-499855f639d4
Must Read: https://github.com/MLDSAI/OpenAdapt/issues/330#issuecomment-1612437801
Tested OpenRedact/anonymizer (but the there is no API refrence, no docs) and also had issues on testing it manuallly.
Then proceeded to find another psuedonymization library, and found about Cape-Privacy
Good Choice Cape-Privacy for OpenAdapt: https://docs.capeprivacy.com/getting-started/
Shall we close this @abrichr as completed ?
@KrishPatel13 can you please link to where this has been implemented?