minimap2 icon indicating copy to clipboard operation
minimap2 copied to clipboard

Small alignment block in between large gaps observed in CIGAR

Open SHuang-Broad opened this issue 5 years ago • 2 comments

Hello Heng,

After you fixed #392 via the latest release (thanks!), I've noticed another edge case that is—I gues—related issue/feature.

I am observing some alignments that are showing CIGAR with small alignment block in between large gaps, e.g.

340D9M340I
341I1M339I
339D1M680D
341I1M340D

Now a side effect of this is that SV callers might be fooled by these CIGARs and emit more than one calls (yes, I've actually observed this behavior).

What do you think should be the correct behavior here? Do you think breaking it into split alignments is better?

I've attached a BAM with such alignment records, where a custom attribute XN:Z is added to emphasize the CIGAR.

Thanks! strange_cigars.zip

SHuang-Broad avatar Jul 02 '19 21:07 SHuang-Broad

Completely fixing this will be very hard. I will have a look at some point. Thanks very much on the example.

lh3 avatar Jul 08 '19 13:07 lh3

I think this is somewhat tied to the characteristics of PacBio read errors (haven't yet looked into ONT reads yet); so yes, I can imagine this being very hard. Thanks!

SHuang-Broad avatar Jul 08 '19 13:07 SHuang-Broad