bids-specification
                                
                                 bids-specification copied to clipboard
                                
                                    bids-specification copied to clipboard
                            
                            
                            
                        How are subgroups to be dealt with in derivatives? E.g. patients versus controls
How are subgroups to be dealt with? E.g. patients versus controls. Perhaps like this:
group-patients
group-controls
where all subjections would be
group-all
_Originally posted by @robertoostenveld in https://github.com/render_node/MDExOlB1bGxSZXF1ZXN0MjM4OTg3Njgy/timeline/more_items
Back in September I added some about this in the Google Docs, but it seems it disappeared - https://github.com/bids-standard/bids-specification/pull/109#discussion_r261477328
EDIT ----
I proposed grp-label instead of group-label as an analogy to sub-label, e.g.:
grp-infants/
    anat/
        grp-infants_desc-average_T1w.nii.gz
could be a group-specific T1w template.
Indeed handling group results have been removed from https://github.com/bids-standard/bids-specification/pull/109 to make the scope more manageable. Sorry this wasn't clear. This issue is to track this feature which hopefully can be covered by a new PR after #109 is merged.
Yep, I think it disappeared before you even started the daunting task of #109. Anyways, that is irrelevant now, let's see whether we can build a neat implementation of groups :)
What about putting this information in a project-wide metadata file? That would help make analysis blind, since researchers could keep the file unopened (or unwriten) up until the conclusion of the analysis. If groups are separated by directories or file names, it's going to be hard to even perform basic preprocessing QC in an unbiased fashion.
Has this been addressed via the example participant.tsv file where there is a group column?
I'd go for a simple solution - a recommended column in participants.tsv and possibly part of the name sub-control001, sub-patient001, ect ... I used that before, worked great
Having recommended naming would however help for pipelines (control is not the same as healthy, for instance you can have a patient group of interest vs a control patient group)
What about putting this information in a project-wide metadata file?
This is readily available with BIDS "raw".
That would help make analysis blind, since researchers could keep the file unopened (or unwriten) up until the conclusion of the analysis. If groups are separated by directories or file names, it's going to be hard to even perform basic preprocessing QC in an unbiased fashion.
This is not a use case of this PR - albeit this is a good practice, BIDS (incl. Derivatives) already permits this be done easily. It's a choice of the researcher to encode things without obvious identifiers.
This PR is for encoding results that are group-wise in nature. Such as the custom template above.
a recommended column in participants.tsv
This is what @TheChymera recommended above, and perfectly doable ATM.
possibly part of the name sub-control001, sub-patient001, ect ... I used that before, worked great
Encoding variables within entity values is discouraged and definitely a bad idea. Best example is QC as Chris was mentioning.
That said, this suggestion does not solve the problem that analysis generates some interim results that derive from two or more different participants.
Apologies for reviving an old post but I was wondering whether there was a followup on this discussion with regards to templates and other grouped intermediate results. I can not seem to find a clear consensus on how to proceed in this case.