cuteSV icon indicating copy to clipboard operation
cuteSV copied to clipboard

Q: Detection SV by aligning to diploid (not haplotype resolved) genome

Open Alteroldis opened this issue 3 months ago • 2 comments

Hi Dr Jiang.

I work with species, genome of which has a high number of genome rearrangements. Because of that I can assembly only diploid version of genome with Flye. And I think, I can resolve these by breaking my reads at points of structural variations and assembly them again. Will you approach work if I align reads to diploid genome, not haploid? And may be this resolve contigs of genome to different alleles (haplotypes)? Could I retrieve points of SV for my reads from output of your tool?

Alteroldis avatar Mar 21 '24 12:03 Alteroldis

Hello @Alteroldis,

This is a very interesting question. cuteSV can identify the breakend which enrolled in two different chromosomes or different haplotypes of homologous chromosomes. Also, cuteSV can report the read ID that supports the breakend event. So I guess cuteSV can help your purpose in this circumstance. You can use minimap2 to align long-reads to the diploid genome, and then run cuteSV.

Best, Tao

tjiangHIT avatar Mar 22 '24 02:03 tjiangHIT

Dear Dr Jiang, thank you for quick answer. I think it makes sense to remove reads that give exactly translocation events, and use the remaining ones for assembly. But, since I have both haplotypes in the assembly and it is unknown which two contigs belong to homologous chromosomes, a problem arises. Let's say reads 1-10 support translocations between contigs A and B. Then there will be another translocation event between contigs B and A with reads 11-21. Is everything correct? By deleting reads 1-21, I will lose part of the genome. And it seemed strange to me that there was a huge translocation event in the logs, but only about 2000 remained in vcf. Perhaps it’s worth tweaking some settings?

Alteroldis avatar Mar 24 '24 14:03 Alteroldis