verkko icon indicating copy to clipboard operation
verkko copied to clipboard

Producing dual assemblies when no trio or Hi-C data is available

Open smkumaill opened this issue 1 year ago • 2 comments

When there is no trio data available, is it possible to produce hifiasm style pseudo-haplotype resolved assemblies?

I was not able to produce the pseudo-haplotypes from the final assembly. Do you have any suggestion as to what tool or method could be utilized to produce these pseudo-haplotypes (if at all possible).

smkumaill avatar Jun 16 '23 11:06 smkumaill

There is currently no unphased output from verkko. However, the combination of hifi and ONT data produces much longer phase blocks than hifi alone (e.g. megabases vs kps on human). The size of the blocks will vary depending on the heterozygosity of the sample. I would suggest looking at the output graph to see how much connectivity remains in your sample due to large homozygous regions. If there are almost none, you can do a simple purge to get a single haplotype on the final assembly (using purge_dups). If not, the only current option would be to provide paths manually through the graph.

skoren avatar Jun 16 '23 15:06 skoren

We've discussed producing a primary/alt style output so I've added the enhancement tag and will keep this open as part of future development.

skoren avatar Jun 23 '23 14:06 skoren