hifiasm icon indicating copy to clipboard operation
hifiasm copied to clipboard

How to deal with extremely high mapping rate?

Open oddguyeee opened this issue 1 year ago • 3 comments

I use hifiasm with HiFi reads to construct primary contigs, and scaffolding with 3ddna pipeline. Although I obtain a relatively complete genome, with 92.9% of busco and 95% of primary contigs, the extremely high mapping rate was found by mapping HiFi reads to final assembly. According to log file, the homozygous and heterozygous read coverage threshold were 36 and 18, respectively, as showed in k-mer plot. How can I reset the assembly parameters? image

oddguyeee avatar Aug 31 '24 08:08 oddguyeee

Those could be centromeric/satellite repeat regions. If so, the higher mapping rate is expected. Use tools like Flagger/Inspector to check the collapsed & duplicated regions of the assembly.

kiratalreja3 avatar Sep 02 '24 00:09 kiratalreja3

Thank you @kiratalreja3 ,

In addition, I have another question that the draft assembly from hifiasm with default parameters show a relatively low busco, 95%, How I can I improve this index?

oddguyeee avatar Sep 02 '24 07:09 oddguyeee

If you have PacBio subreads data, run DeepConsensus for error-correction before assembly with hifiasm. Also make sure you run adapter trimming. After the assembly is done, you can use Inspector/similar tools to perform error correction. Then, after 3DDNA - consider using a gap filling tool like TGS GapCloser to fill the scaffolding gaps. These are some strategies that I use.

kiratalreja3 avatar Sep 02 '24 23:09 kiratalreja3