datafaker
datafaker copied to clipboard
Allow an easy way to create random PII/PHI data
In a project I'm working on, I'm working with AWS Comprehend to detect and redact PII data.
It would be nice if Datafaker could generate some explicit data including PII data, such as birth dates, full names, credit card data, medical records, etc.
Perhaps something like:
faker.pii().medical()
faker.pii().banking()
faker.pii().general()
or
faker.medical().phi()
faker.banking().pii()
Not sure yet, but the output should be something like:
Hello Zhang Wei, I am John. Your AnyCompany Financial Services, LLC credit card account 1111-0000-1111-0008 has a minimum payment of $24.53 that is due by July 31st. Based on your autopay settings, we will withdraw your payment on the due date from your bank account number XXXXXX1111 with the routing number XXXXX0000.
Customer feedback for Sunshine Spa, 123 Main St, Anywhere. Send comments to Alice at [email protected]. I enjoyed visiting the spa. It was very comfortable but it was also very expensive. The amenities were ok but the service made the spa a great experience.
(this is the AWS sample message).
I'll most likely build the above, but suggestions / feedback welcome.