miniprot icon indicating copy to clipboard operation
miniprot copied to clipboard

An error when looking for 200 proteins in a genome

Open gilloe opened this issue 1 year ago • 6 comments

Hi, I get the following error when running miniprot looking for 200 proteins at a 73MB genome: [M::[email protected]1.01] read 68163509 bases in 69053 contigs [M::[email protected]1.01] 608218 blocks [M::[email protected]2.83] collected syncmers [M::[email protected]1.13] 23643989 kmer-block pairs [M::mp_idx_print_stat] 1694743 distinct k-mers; mean occ of infrequent k-mers: 13.95; 0 frequent k-mers accounting for 0 occurrences ##gff-version 3 BUG! 1929 == 1929? 621 == 622? 40M1I57M3D83M2D98M3I8M1I59M1D26M15U117M1D46M2D30M5I27M1I5M63U12M miniprot_: align.c:195: mp_extra_cal: Assertion `al == r->qe - r->qs' failed. Aborted

I tried to run it looking for 20 proteins in the same genome and it worked fine, and also looked for the 200 proteins in a smaller genome, and it works fine. So I don't think it is a file format issue. Any suggestions?

Thanks, Gil

gilloe avatar Jun 06 '23 10:06 gilloe

Could you share me with the proteins and the reference genome?

lh3 avatar Jun 06 '23 13:06 lh3

Yes, thanks. The genome can be downloaded from: https://www.ncbi.nlm.nih.gov/nuccore/LDNA00000000.1 And the query file is attached. Hydra_uniprot (2).zip

gilloe avatar Jun 07 '23 08:06 gilloe

I downloaded the genome and could get the results:

[M::[email protected]*1.00] read 68163509 bases in 69053 contigs
[M::[email protected]*1.00] 608218 blocks
[M::[email protected]*2.52] collected syncmers
[M::[email protected]*2.09] 23643989 kmer-block pairs
[M::mp_idx_print_stat] 1694743 distinct k-mers; mean occ of infrequent k-mers: 13.95; 0 frequent k-mers accounting for 0 occurrences
[M::worker_pipeline::1.299*2.27] mapped 200 sequences
[M::main] Version: 0.11-r234
[M::main] CMD: ./miniprot --gff GCA_001455295.2_ASM145529v2_genomic.fna.gz Hydra_uniprot.fasta

Based on the total number of bases, we are using the same reference.

What version are you using?

lh3 avatar Jun 08 '23 02:06 lh3

How do I know which version I have?

gilloe avatar Jun 08 '23 14:06 gilloe

miniprot --version. You may just try the latest version.

lh3 avatar Jun 08 '23 15:06 lh3

0.11-r235-dirty I downloaded it very recently.

gilloe avatar Jun 08 '23 15:06 gilloe

Closing. Not sure what was happening, but if I couldn't reproduce the error, I wouldn't be able to fix.

lh3 avatar Sep 30 '24 19:09 lh3