bids-specification icon indicating copy to clipboard operation
bids-specification copied to clipboard

How are subgroups to be dealt with in derivatives? E.g. patients versus controls

Open chrisgorgo opened this issue 6 years ago • 8 comments

How are subgroups to be dealt with? E.g. patients versus controls. Perhaps like this:

group-patients
group-controls

where all subjections would be

group-all

_Originally posted by @robertoostenveld in https://github.com/render_node/MDExOlB1bGxSZXF1ZXN0MjM4OTg3Njgy/timeline/more_items

chrisgorgo avatar Mar 05 '19 16:03 chrisgorgo

Back in September I added some about this in the Google Docs, but it seems it disappeared - https://github.com/bids-standard/bids-specification/pull/109#discussion_r261477328

EDIT ----

I proposed grp-label instead of group-label as an analogy to sub-label, e.g.:

grp-infants/
    anat/
        grp-infants_desc-average_T1w.nii.gz

could be a group-specific T1w template.

oesteban avatar Mar 08 '19 02:03 oesteban

Indeed handling group results have been removed from https://github.com/bids-standard/bids-specification/pull/109 to make the scope more manageable. Sorry this wasn't clear. This issue is to track this feature which hopefully can be covered by a new PR after #109 is merged.

chrisgorgo avatar Mar 08 '19 17:03 chrisgorgo

Yep, I think it disappeared before you even started the daunting task of #109. Anyways, that is irrelevant now, let's see whether we can build a neat implementation of groups :)

oesteban avatar Mar 08 '19 19:03 oesteban

What about putting this information in a project-wide metadata file? That would help make analysis blind, since researchers could keep the file unopened (or unwriten) up until the conclusion of the analysis. If groups are separated by directories or file names, it's going to be hard to even perform basic preprocessing QC in an unbiased fashion.

TheChymera avatar Sep 10 '19 12:09 TheChymera

Has this been addressed via the example participant.tsv file where there is a group column?

rob-luke avatar Jun 20 '21 05:06 rob-luke

I'd go for a simple solution - a recommended column in participants.tsv and possibly part of the name sub-control001, sub-patient001, ect ... I used that before, worked great

Having recommended naming would however help for pipelines (control is not the same as healthy, for instance you can have a patient group of interest vs a control patient group)

CPernet avatar Jun 20 '21 07:06 CPernet

What about putting this information in a project-wide metadata file?

This is readily available with BIDS "raw".

That would help make analysis blind, since researchers could keep the file unopened (or unwriten) up until the conclusion of the analysis. If groups are separated by directories or file names, it's going to be hard to even perform basic preprocessing QC in an unbiased fashion.

This is not a use case of this PR - albeit this is a good practice, BIDS (incl. Derivatives) already permits this be done easily. It's a choice of the researcher to encode things without obvious identifiers.

This PR is for encoding results that are group-wise in nature. Such as the custom template above.

a recommended column in participants.tsv

This is what @TheChymera recommended above, and perfectly doable ATM.

possibly part of the name sub-control001, sub-patient001, ect ... I used that before, worked great

Encoding variables within entity values is discouraged and definitely a bad idea. Best example is QC as Chris was mentioning.

That said, this suggestion does not solve the problem that analysis generates some interim results that derive from two or more different participants.

oesteban avatar Jun 20 '21 07:06 oesteban

Apologies for reviving an old post but I was wondering whether there was a followup on this discussion with regards to templates and other grouped intermediate results. I can not seem to find a clear consensus on how to proceed in this case.

omar-rifai avatar Mar 14 '22 14:03 omar-rifai