ExpansionHunter icon indicating copy to clipboard operation
ExpansionHunter copied to clipboard

Question on using bam by chr for EH analysis

Open methionine23 opened this issue 1 year ago • 3 comments

Hello, I wonder if anyone has experiences and thoughts on running EH using a sub bam (eg bams of one chr or an assembly for many samples). Certainly need to figure out MAP 0 and off target reads/region. Will that work?

methionine23 avatar Feb 23 '24 18:02 methionine23

Do you mean that you first align reads on all chromosomes and then subsetting a chromosome/gene/etc. from that file and running EH on the subsetted BAM?

andreasssh avatar Feb 27 '24 09:02 andreasssh

yes, also similar for a situation when I have bam per chr (or contig) instead of as in one bam. Any suggestions?

methionine23 avatar Feb 27 '24 14:02 methionine23

That's fine, works well. Only caveat is that if you want to use off-target regions then indeed, can't use reads aligned on a different chromosome if it is not present in your BAM. However, for expanded alleles you can still get a repeat estimate somewhere between read and fragment length (without using offtargets), which you can then investigate further and run on a full BAM if possible...

andreasssh avatar Feb 27 '24 22:02 andreasssh