REViewer icon indicating copy to clipboard operation
REViewer copied to clipboard

Does REViewer plot reads from off-target regions?

Open bw2 opened this issue 4 years ago • 2 comments

I ran EHv4 on the FXN locus, using 2 repeat specs which are identical except that the 2nd one includes GAA off-target regions:

{
        "LocusId": "FXN-chr9-69037286-69037304-GAA",
        "LocusStructure": "(GAA)*",
	"RepeatUnit": "GAA",
        "ReferenceRegion": "chr9:69037286-69037304",
        "VariantType": "RareRepeat",
	"OfftargetRegions": []
    },
    {
        "LocusId": "FXN-chr9-69037286-69037304-GAA-with-off-targets",
        "LocusStructure": "(GAA)*",
	"RepeatUnit": "GAA",
        "ReferenceRegion": "chr9:69037286-69037304",
        "VariantType": "RareRepeat",
	"OfftargetRegions": [
	    "chr2:220546033-220546610",
	    "chr5:127247161-127247640",
	    "chrX:51621350-51621856",
	    "chr1:101657701-101658187",
	    "chr13:102161416-102161881",
	    "chr7:37848005-37848522",
	    "chrY:25645531-25646013",
	    "chr7:84690949-84691442",
	    "chrUn_KN707747v1_decoy:1062-2074",
	    "chr6:50708070-50708556",
	    "chrY:24024122-24024600"
	]
    },

These are the EHv4 results in the (relatively rare) WGS sample where the genotypes from the 2 specs differed significantly between no-off-targets:

chr9	69037286	.	A	<STR9>,<STR110>	.	PASS	END=69037304;REF=6;RL=18;RU=GAA;VARID=FXN-chr9-69037286-69037304-GAA;REPID=FXN-chr9-69037286-69037304-GAA	GT:SO:REPCN:REPCI:ADSP:ADFL:ADIR:LC	1/2:SPANNING/INREPEAT:9/110:9-10/63-153:2/0:5/13:0/12:42.016851

and with off-targets:

chr9	69037286	.	A	<STR33>,<STR726>	.	PASS	END=69037304;REF=6;RL=18;RU=GAA;VARID=FXN-chr9-69037286-69037304-GAA-with-0.01-threshold-off-targets;REPID=FXN-chr9-69037286-69037304-GAA-with-0.01-threshold-off-targets	GT:SO:REPCN:REPCI:ADSP:ADFL:ADIR:LC	1/2:FLANKING/INREPEAT:33/726:33-111/658-1313:0/0:13/13:4/108:42.016851

I then ran REViewer for both outputs, and got these plots

no-off-targets: CDS-nC6iXU_FXN-chr9-69037286-69037304-GAA_ExpansionHunter4

with-off-targets: CDS-nC6iXU_FXN-chr9-69037286-69037304-GAA-with-0 01-threshold-off-targets_ExpansionHunter4

finally, this is the plot from when I used the standard FXN repeat spec included in the EHv4 repo: CDS-nC6iXU_FXN-chr9-69037286-69037304-GAA-official_ExpansionHunter4

I'm wondering how to interpret the "with-off-targets plot". In REViewer docs, I saw ..the current version of REViewer visualizes repeats whose span does not exceed the fragment length (longer repeats are capped at the fragment length). Does REViewer not plot the off-target FRRs?

Thanks -Ben

bw2 avatar Jan 11 '21 02:01 bw2

That's right. The current version of EH does not generate graph alignments of in-repeat read pairs (both mates inside the repeat). To account for the absence of these reads, REViewer caps the repeat size estimate at the fragment length. However, your example (with-off-targets) shows that sometimes the size of the repeat needs to be reduced further.

For now, we could cap the repeat size in a better way that would make both plots (with-off-targets and no-off-targets) look the same. Of course, a better solution would be to enable EH to output alignments of in-repeat read pairs.

egor-dolzhenko avatar Jan 12 '21 07:01 egor-dolzhenko

I see, that makes sense. Thanks.

bw2 avatar Jan 12 '21 16:01 bw2