mag icon indicating copy to clipboard operation
mag copied to clipboard

GTDB-TK fails due to insufficient memory

Open nayeimkhan opened this issue 3 years ago • 3 comments

Description of the bug

Hi, I have been trying to run the MAG pipeline with the latest version. However, the pipeline is failing at the GTDBTK_Classify step due to insufficient memory. Is there a workaround around the memory or is there a way to bypass the GTDB step? Any suggestions here will be helpful. Thanks!

Command used and terminal output

Command used - "nextflow run nf-core/mag -profile docker --input "$(pwd)/*_R{1,2}_001.fastq.gz" --outdir test_1 --skip_spades -r 2.2.1 --binning_map_mode own --skip_maxbin2 --skip_prokka --skip_prodigal --keep_phix true" 

Output
-[nf-core/mag] Pipeline completed with errors-
Error executing process > 'NFCORE_MAG:MAG:GTDBTK:GTDBTK_CLASSIFY '

Caused by:
  Process requirement exceed available memory -- req: 128 GB; avail: 123.2 GB

Relevant files

No response

System information

No response

nayeimkhan avatar Sep 07 '22 20:09 nayeimkhan

Hi @nayeimkhan , you can use --gtdb false to skip the GTDB classification.

skrakau avatar Sep 08 '22 06:09 skrakau

Thanks @skrakau for your suggestion!

nayeimkhan avatar Sep 10 '22 18:09 nayeimkhan

Newer versions of GTDB-Tk require significantly less memory than the one currently available in mag. See https://github.com/Ecogenomics/GTDBTk#new-features

This reduces the memory requirements of GTDB-Tk from 320 GB of RAM when using the full GTDB R07-RS207 reference tree to approximately 55 GB.

maxibor avatar Oct 19 '22 09:10 maxibor

I close this in favor of https://github.com/nf-core/mag/issues/409

d4straub avatar May 09 '23 11:05 d4straub