xqtl-protocol
xqtl-protocol copied to clipboard
Sample duplication issue in ROSMAP RNASeq and Genotype
This serves as a reminder of the Questions to be answered, @Rhopala will make a comprehensive description of the scenario later on. @gaow please correct me if I got the question wrong.
According to the ROSMAP metadata csv on synapse
-
RNA Seq: 1.1. For some individual ID("R2809589"), there will be two records of RNA Seq for the same tissue. 1.1.1 why is that 1.1.2. how does it impact our analysis 1.1.3 what to do with it.
-
Genotype 2.1 For some individual ID("R3257830"), there will be two records of WGS for the different or even same tissue ("R1631616"). 2.1.1 Why 2.1.2. how does it impact our analysis 2.1.3 The way to deal with it is by removing duplicates while maximizing overlap