bamdst icon indicating copy to clipboard operation
bamdst copied to clipboard

Coverage difference between bamdst and gatk DepthOfCoverage

Open jhylwq123 opened this issue 1 year ago • 1 comments

Does bamdst remove duplicates reads automatically? I first used the "time picard MarkDuplicates REMOVE_DUPLICATES=true" function and calculate the coverage using "gatk DepthOfCoverage" based on removed duplicated bam file. The I used bamdst to calculate the coverage based on unremoved duplicated bam file, because I thought bamdst has the function of remove duplicates. However, bamdst calculated a much more higher coverage(100X) than gatk. How could this happen?

jhylwq123 avatar Jun 03 '23 07:06 jhylwq123

No, bamdst only read the alignment information from the BAM. The PCR duplicate reads in the report are labeled in the SAM flag. Therefore you should use GATK/samtools/sambamba to mark duplicates before calculating the coverage.

shiquan avatar Jun 03 '23 18:06 shiquan