airr-standards icon indicating copy to clipboard operation
airr-standards copied to clipboard

Does having an actual date for sequencing open the potential for subject identification?

Open schristley opened this issue 1 year ago • 2 comments

My understanding is one technique to avoid subject identification is by replacing dates with relative times from an arbitrary t0 time point and that's what we have done from most of the AIRR standards. However I noticed sequencing_date is an actual date. Does it make sense to convert this to a relative time as well?

schristley avatar Jul 23 '24 16:07 schristley

sequencing_date is when you turn on the NextSeq, right? That could be arbitrarily long after sampling or even after the conclusion of the study. So I'm not sure relative time would make sense...

scharch avatar Jul 23 '24 18:07 scharch

Yeah, that's my understanding, though it should be before the conclusion of the study, if the study involves analyzing AIRR-seq then you need to sequence first... But yes, it could be arbitrary long after sampling or very soon.

I don't have a clear use case in my mind on what might happen. I was mainly wondering, if you knew the sequencing date then maybe there was the potential to backtrack and roughly determine other dates. As always you need other information if trying to do identification.

Thinking about it in another way. By having a date, you don't really know when the sequencing was done in comparison to the sampling, did the sample sit in the freezer for 20 years? Maybe that's a useful tidbit?

schristley avatar Jul 23 '24 19:07 schristley