anonymize-slide icon indicating copy to clipboard operation
anonymize-slide copied to clipboard

Remove Metadata such as scan date, image ID

Open jetic83 opened this issue 7 years ago • 3 comments

Is there a way to delete certain fields in the scans such as comments or scan date or filename? These metainformation can be considered as PHI, or as non-anonymized data.

jetic83 avatar Jul 02 '18 20:07 jetic83

Has anyone ever followed up on this issue because it would make anonymize-slide much more useful. As of now, the remaining PHI within the metadata are proving to be an issue.

a-dev-walker avatar Mar 05 '20 21:03 a-dev-walker

It can be done. Examples from a PR:

fh.directories[1].entries[XMLPACKET].overwrite_entry(our_xmp) fh.directories[1].entries[IMAGE_DESCRIPTION].overwrite_entry(our_image_desc)

These overwrite the XMLPACKET and IMAGE_DESCRIPTION tags on directory #1. To delete, overwrite with safe values. Make sure to overwrite with a message the same length as the original otherwise the data might still be there.

markemus avatar Jun 23 '20 22:06 markemus

@jetic83, @a-dev-walker: Within the EMPAIA project, we have now developed our own solution for anonymizing WSIs (in various formats) including all the sensitive metadata. This is currently available via Gitlab . The paper for this is currently in review, the preprint can already be viewed at arXiv.

Tomatenbiss avatar Nov 14 '22 08:11 Tomatenbiss