diamond icon indicating copy to clipboard operation
diamond copied to clipboard

low RAM & CPU efficiency on slurm

Open schraderL opened this issue 3 years ago • 1 comments

Hi, I am running diamond blastx on some individual eukaryotic scaffolds on a HPC environment with slurm with the following settings:

diamond blastx \
        --query ${assembly} \
        --db uniprot/reference_proteomes.dmnd \
        --outfmt 6 qseqid staxids bitscore qseqid sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore \
        --fast \
        --max-target-seqs 1 \
        --evalue 1e-25 \
        --threads ${threads} \
        > ${assembly}.diamond.blastx.out

I have run this with 36 threads and 90 GB of RAM. However, CPU & RAM efficiency are both less than 5 % according to slurm:

Nodes: 1 Cores per node: 36 CPU Utilized: 02:06:08 CPU Efficiency: 2.91% of 3-00:10:12 core-walltime Job Wall-clock time: 02:00:17 Memory Utilized: 4.20 GB Memory Efficiency: 4.66% of 90.00 GB

Is there a way to improve this run so that diamond can more effectively use the resources available?

Thanks! Lukas

schraderL avatar Jul 08 '22 08:07 schraderL

The problem is probably the long input sequences which is not efficient in regular blastx mode, try setting -F 15. Additionally I'd recommend -b4 -c1.

bbuchfink avatar Jul 08 '22 08:07 bbuchfink