MitoFinder icon indicating copy to clipboard operation
MitoFinder copied to clipboard

Not finding the mito in assembled CLR data

Open aureliendejode opened this issue 1 year ago • 3 comments

Hello,

I tried to find the mitochondrial sequences in my assembly. I assembled CLR PacBio reads using CANU for a mollusc genome. As reference I used 2 mitochondrial genomes of the same species and I used the genetic code 5 (Mito Invertebrates), but Mitofinder said that it could not find mitochondrial sequence in contigs less than 25 000 bp.

Does anyone know what is happening ? Is it better to use MitoFinder on the reads for that particular case ?

Thanks for your help

Aurélien

aureliendejode avatar Mar 01 '23 20:03 aureliendejode

Hello Aurélien,

There are two possibilities here. Either CANU discarded the mitochondrial sequences due to their suspicious coverage (i.e. really high compared to genomic sequences), or some chimeric sequences have been created by CANU. In that case, you can allow MitoFinder to search for longer sequences (--max-contig-size option) and see if several mitogenomes have been concatenated during the assembly step. Unfortunately, MitoFinder is not designed yet to handle long-reads data. Starting from reads is therefore not possible in your case. You can try to assemble your reads with an alternative assembler ...

Sorry for the inconvenience, Best regards, Rémi

RemiAllio avatar Mar 01 '23 20:03 RemiAllio

Hello Rémi,

Thanks for this I was able to fetch the contigs but MitoFinder in the raw CANU assembly. My guess is that it was probably eliminated by purge_dups because it has a filter on coverage. however, MitoFinder could not circularize it. I there something i can do to help the circularization ?

Aurélien

aureliendejode avatar Apr 25 '23 20:04 aureliendejode

Hello Rémi,

I was wondering if you would have any insights about why MitoFinder was unable to circularize the mitochondrial dna ?

Best Aurélien

aureliendejode avatar Oct 11 '23 16:10 aureliendejode