presidio icon indicating copy to clipboard operation
presidio copied to clipboard

Adding QR codes support in the ImageRedactorEngine

Open vpvpvpvp opened this issue 2 years ago • 1 comments

Is your feature request related to a problem? Please describe. QR codes usually contain PII entities. But ocr engines like tesseract can't recognize QR codes on images.

Describe the solution you'd like Integrate into Presidio Image Redactor the ability to recognize QR codes and analyze their contents. And if the QR code contains PII, then redact it. One way to implement this functionality is to add a new QRImageAnalyzerEngine in addition to the default ImageAnalyzerEngine. And then passing QRImageAnalyzerEngine as the image_analyzer_engine parameter to ImageRedactorEngine and to ImagePiiVerifyEngine.

Additional context An example of the input image (first image) and the desired output (second image):

qr_multi-min

qr_multi_redacted-min

vpvpvpvp avatar Feb 17 '23 18:02 vpvpvpvp

Thanks! we'll review the PR shortly.

omri374 avatar Feb 19 '23 09:02 omri374