GWA_tutorial
GWA_tutorial copied to clipboard
why individual 13291 NA07045 in 0.2_low_call_rate_pihat.txt
May I ask "For each pair of 'related' individuals with a pihat > 0.2, we recommend to remove the individual with the lowest call rate"? What does call rate mean in this sentence, and why do I need to "plink --bfile HapMap_3_r3_11 --missing" after getting the file pihat_min0.2_in_founders.genome? Of course the crux of the matter is how to get the result by the above operation, individual 13291 NA07045 has the lowest call rate, i.e. how to get the data in 0.2_low_call_rate_pihat.txt?
You should have a look into the pihat_min0.2.genome file to observe conflictions. Once you know the samples thatare highly related, you want to remove the one, which has the higher proportion of missing genotypes. How? Based on my understanding, samples with a lower proportion of missing genotypes can be considered to have a better call rate. Thus, you need to use plink --bfile HapMap_3_r3_11 --missing again. NA07045 has a higher proportion of missing genotypes and should therefore be removed.