mirp icon indicating copy to clipboard operation
mirp copied to clipboard

Consider adding DICOM attributes to feature extraction output

Open kirbyju opened this issue 9 months ago • 2 comments

Please use the below template to submit your issue:

Expected behaviour

In the output of extract_features() when working with DICOM data I think it would be helpful to add image_series_description, image_series_instance_uid, mask_series_description, and mask_series_instance_uid columns. It may also be worth including image_study_description and image_study_date. This would make it easier to see which images/masks were being analyzed in datasets that contain multiple RTSTRUCT tumor segmentations per subject (e.g. the annotated TCIA CPTAC collections. In many cases the same tumor was annotated in multiple scans which resulted the same sample_name and image_mask name.

Observed behaviour

Only sample_name, image_modality and image_mask_name were provided as a way to identify the extraction results.

Steps to reproduce the problem

You can find sample data to reproduce this at https://doi.org/10.7937/SKQ4-QX48.

Specifications

  • MIRP version: 2.2.1
  • Python version: 3.8.11
  • Operating system: MacOS Sonoma 14.4.1
  • Other package versions (if relevant):

kirbyju avatar May 14 '24 12:05 kirbyju