Cassiopeia
Cassiopeia copied to clipboard
Potential Bug in intBC extract
Hi Cassiopeia Team,
I carefully checked the result from Cassiopeia, and Found a bug in some reads in SRR11357694
, for example:
AATCCAGCTAGCTGTGCAGCTCTCCGTTAGACATTTCAACTGCAGTAATGCTACCTCGTACTCACGCTTTCCAAGTGCTTGGCGTCGCATCTCGGTCCTTTGTACGCCGAAAAATGGCCTGACAACTAAGCTACGGCACGCTGCCATGTTGGGTCATAACGTGGTTCATCCGTGACCGAACATGTCATGGAGTAGCAGGAGCTATTAATTCGCGGAGGACAATGCGGTTCGTAGTCACTGTCTTCCGCAATCGTCCATCGCTCCTGCAGGTGGCCTAGAGGGCCC
with CIGAR(34M1D127M7D124M), we could manually map this to reference, it should be:
And the intBC from Cassiopeia is TCTCCGTTAGACATT
. from above, it seems that it should end with AT, rather than ATT in this context.
More information for you:
readName cellBC UMI readCount Seq CIGAR QueryBegin ReferenceBegin AlignmentScore r1 r2 r3 allele intBC cellbc_umi GGGAATGAGGGATCTG-TGCTACCGTA GGGAATGAGGGATCTG_TGCTACCGTA_000117_0+ GGGAATGAGGGATCTG TGCTACCGTA 117 AATCCAGCTAGCTGTGCAGCTCTCCGTTAGACATTTCAACTGCAGT... 34M1D127M7D124M 0 0 1309 CCGAA[None]AAATG TAACG[163:7D]TGGTT ATTCG[None]CGGAG CCGAA[None]AAATGTAACG[163:7D]TGGTTATTCG[None]C... TCTCCGTTAGACATT
could you figure out why ? Thanks