minimap2 icon indicating copy to clipboard operation
minimap2 copied to clipboard

No output from minimap when generating the index first.

Open dosshra opened this issue 3 years ago • 2 comments

Hello I am trying to use minimap to align genome assembly to a reference genome. Since the genome is very large, (~10GB) I have split the new genome assembly to 2500 contig files and run it in parallel on HPC. To save the time of indexing, I first built a mmi index. These are the commands I used:

minimap2 -x asm20 -t 3 -d ref.mmi ref.fas
minimap2 -ax asm20 -t 20 ref.mmi contig.fas >contig.sam

This is the log I get:

Mon Mar  1 17:25:18 IST 2021
[WARNING]ESC[1;31m For a multi-part index, no @SQ lines will be outputted. Please use --split-prefix.ESC[0m
[M::main::36.784*0.69] loaded/built the index for 6 target sequence(s)
[M::mm_mapopt_update::42.710*0.73] mid_occ = 1347
[M::mm_idx_stat] kmer size: 19; skip: 10; is_hpc: 0; #seq: 6
[M::mm_idx_stat::46.892*0.75] distinct minimizers: 210756922 (76.44% are singletons); average occurrences: 3.738; average spacing: 5.689
Mon Mar  1 17:26:12 IST 2021

The contig.sam file is empty. Runing minimap without the indexing step is giving the expected SAM file. Thank you

dosshra avatar Mar 02 '21 05:03 dosshra

There should be other messages. It is probably killed by your job scheduler. Increase the memory limit.

lh3 avatar Mar 02 '21 13:03 lh3

Thank you for the response. I am working on ~10GB reference genome with ~1Mb contigs. I am using 120GB RAM. This configuration works well when I do not do the indexing first.

dosshra avatar Mar 03 '21 14:03 dosshra