racon icon indicating copy to clipboard operation
racon copied to clipboard

Polish noisy long reads with hifi long reads

Open TesiNicco opened this issue 5 years ago • 4 comments

Hi,

I am relatively new to racon. I am trying to polish noisy long reads (pacbio) using hi-fidelity long reads (pacbio).

I am using pbmm2 (SMRT C++ wrapper for minimap2's C API) for alignment: pbmm2 align hifi_reads.fa reads_to_correct.fa reads_to_correct_aligned.bam The resulting sam file contains alignments.

Then I would use racon to polish the reads: racon hifi_reads.fa reads_to_correct_aligned.sam reads_to_correct.fa

The error I got is: [racon::Polisher::initialize] error: empty overlap set!

From what I understood, the way the files are specified in the racon command should be correct. Any idea what can be the cause of this?

Additional details: reads for correction.fa = 2 high quality reads (quality ~99%) reads to be corrected.fa = 6 noisy reads (quality ~80%) All the reads align to the same portion of the reference genome.

Thanks in advance

TesiNicco avatar Dec 05 '20 19:12 TesiNicco

Hi Niccolo, you have to reorder HiFi and CLR reads in your pbmm2 command (first CLR reads which are the "reference" on which you map the HiFi reads). As you have 2 HiFi vs 6 CLR reads, use parameter -f when running Racon, so that it uses all found alignments. Otherwise, it will use only the longest alignment per HiFi read and the majority of the CLR reads will not be polished.

Best regards, Robert

rvaser avatar Dec 06 '20 12:12 rvaser

Hi Robert,

I have a question about polishing my data. I have sequenced 150× data for my plant using Illumina hiseq (150bp × 2). I am also sequencing my plant with the Pacbio Hifi (ccs) method. According to this paper 'Accurate circular consensus long-read sequencing improves variant detection and assembly of a human genome', the accuracy of the HIFI is about 99.8 percent. I am wondering if it still needs to use the Hiseq data to polish the assembly (correct the base errors) from the HIFI reads.

I appreciate your help.

Best,

Lingyun

lychen83 avatar Dec 24 '20 01:12 lychen83

Hi Lingyun, I have not evaluated HiFi vs Illumina polishing so far, so I cannot advise you. You could first polish with HiFi, and afterwards with Illumina, and evaluate the accuracy of both steps.

Best regards, Robert

rvaser avatar Dec 25 '20 01:12 rvaser

HI @rvaser,

I try to use HIFI reads polish ONT reads by racon, but I failed.

[racon::Polisher::initialize] error: empty overlap set!

It is the command.

racon hifi.small.fq small.paf UL.small.fq

And it is the file.

UL.small.fq.txt hifi.small.fq.txt small.paf.txt

Can you give me some advise?

Best regards, Hang.

socialhang avatar Mar 25 '23 08:03 socialhang