gatk icon indicating copy to clipboard operation
gatk copied to clipboard

Variant lost when interval expanded.

Open wangshun1121 opened this issue 3 years ago • 0 comments

GATK version: gatk-4.2.3

I have two bam files: Tumor.bam is cfDNA data, and Normal.bam were reads from white blood cells.

There's a variant C to G at chr7:116795782 (hg38), I can get this variant using following command:

gatk --java-options -Xmx4000m Mutect2  \
  -R /Homo_sapiens_assembly38.fasta \
  -I Tumor.bam \
  -I Normal.bam \
  -normal GA03009QX \
  -L chr7:116795653-116795791 \
  --interval-padding 100 \
  -O output.vcf

and I can get the variant I want in output.vcf

##fileformat=VCFv4.2
##FORMAT=<ID=AD,Number=R,Type=Integer,Description="Allelic depths for the ref and alt alleles in the order listed">
##FORMAT=<ID=AF,Number=A,Type=Float,Description="Allele fractions of alternate alleles in the tumor">
...
##filtering_status=Warning: unfiltered Mutect 2 calls.  Please run FilterMutectCalls to remove false positives.
##normal_sample=GA03009QX
##source=Mutect2
##tumor_sample=GA03009CF
#CHROM	POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	GA03009CF	GA03009QX
chr7	116795782	.	C	G	.	.	AS_SB_TABLE=1157,1223|8,9;DP=2519;ECNT=1;MBQ=20,20;MFRL=169,173;MMQ=60,60;MPOS=30;NALOD=1.23;NLOD=4.82;POPAF=6.00;TLOD=20.23	GT:AD:AF:DP:F1R2:F2R1:FAD:SB	0/1:2357,17:8.145e-03:2374:739,8:746,4:1570,12:1145,1212,8,90/0:23,0:0.056:23:5,0:11,0:16,0:12,11,0,0

Then, I add 4 exon regions next to the exon where my variant is in interval_list.bed:

chr7    116781986	116782097
chr7    116783302   116783469
chr7	116795653	116795791
chr7	116795885	116796124
chr7	129189150	129189482

and I use similar GATK command line:

gatk --java-options -Xmx4000m Mutect2  \
  -R /Homo_sapiens_assembly38.fasta \
  -I Tumor.bam \
  -I Normal.bam \
  -normal GA03009QX \
  -L interval_list.bed \
  --interval-padding 100 \
  -O output.vcf

and the variant I want at chr7:116795782 LOST:

##filtering_status=Warning: unfiltered Mutect 2 calls.  Please run FilterMutectCalls to remove false positives.
##normal_sample=GA03009QX
##source=Mutect2
##tumor_sample=GA03009CF
#CHROM	POS	ID	REF	ALT	QUAL	FILTER	INFO	FORMAT	GA03009CF	GA03009QX
chr7	116781913	.	T	C	.	.	AS_SB_TABLE=0,0|368,162;DP=540;ECNT=1;MBQ=0,20;MFRL=0,179;MMQ=60,60;MPOS=21;NALOD=-1.163e+01;NLOD=-1.133e+01;POPAF=6.00;TLOD=1695.59	GT:AD:AF:DP:F1R2:F2R1:FAD:SB	0/1:0,527:0.997:527:0,190:0,194:0,384:0,0,365,162	0/0:0,3:0.800:3:0,2:0,1:0,3:0,0,3,0

Two bam files are in my following bams.zip: bams.zip

wangshun1121 avatar Sep 16 '22 08:09 wangshun1121