spades icon indicating copy to clipboard operation
spades copied to clipboard

SPAdes scaffolds assembly missing (skipping)

Open luke831215 opened this issue 6 years ago • 13 comments

Hi all,

I'm using paired-end assembly on GAGE-b MiSeq dataset. (--pe-1 r1.fastq --p2-2 r2.fastq)

In the log file, I found "Skipping processing of scaffolds (empty file)" and the output therefore missed scaffolds.fasta but there was configs.fasta. Any clue if it is a memory issue or anything else?

I use veresion 3.11.1 and set num_thread to 50 and memory limit to 1024 GB and run on a machine with 96 cores and 1.5TB RAM.

Any help is appreciated!

image

luke831215 avatar Nov 06 '17 08:11 luke831215

Hello

Please provide us your spades.log file

asl avatar Nov 06 '17 10:11 asl

This smells like an I/O issue. Does the problem reproduce if you'd simply restart everything from scratch?

asl avatar Nov 09 '17 11:11 asl

Yes, it seems so with the same input and parameters. According to our discussion, could it mean it is almost impossible to produce scaffolds from the assembled contigs? This way, we could simply treat it as a scaffold that contains only a single contig?

luke831215 avatar Nov 11 '17 17:11 luke831215

According to our discussion, could it mean it is almost impossible to produce scaffolds from the assembled contigs?

Well, the situation with SPAdes is quite the opposite. contigs are generated from scaffolds, so the files are always there and contigs should be equal to scaffolds in this case. So what you're seeing is an indication of real issue.

Will it be possible for your to share your data with us so we could reproduce and fix the issue?

asl avatar Nov 13 '17 11:11 asl

Glad to share!

params.txt spades.log Dataset downloaded from SRA: fastq-dump --split-files SRR522246 Reference genome: https://goo.gl/8X3qCj I use QUAST gage mode as benchmark tools.

luke831215 avatar Nov 13 '17 17:11 luke831215

Your second spades.log clearly shows that the scaffolds.fasta are there and processed:

== Processing of scaffolds


== Running contig polishing tool: /home/luke831215/tools/SPAdes-3.11.1-Linux/bin/corrector /home/ph/mnt/biocollab/GAGE-B/b-sides/R_sphaeroides_MiSeq/assembly/mismatch_corrector/scaffolds/configs/corrector.info /home/ph/mnt/biocollab/GAGE-B/b-sides/R_sphaeroides_MiSeq/assembly/misc/assembled_scaffolds.fasta


== Dataset description file was created: /home/ph/mnt/biocollab/GAGE-B/b-sides/R_sphaeroides_MiSeq/assembly/mismatch_corrector/scaffolds/configs/corrector.info

Also, the second dataset is different from the first one. We need the one where the issue could be reproduced. We're certainly having the R.sphaeroides dataset from GAGE-B and everything is ok there.

asl avatar Nov 13 '17 20:11 asl

Sorry, this should be right. params.txt spades.log Link for input reads: https://goo.gl/5wevh4

The reads here are the subset (93.4%) of R.sphaeroides MiSeq dataset. Please note that I don't input single reads using --pe1-s here. Adding up the single reads does produce scaffolds.fasta for this subset, but not in all other cases.

luke831215 avatar Nov 14 '17 07:11 luke831215

Sorry, this should be right.

Doesn't seem so. The log has:

== Processing of scaffolds


== Running contig polishing tool: /home/luke831215/tools/SPAdes-3.11.1-Linux/bin/corrector /home/ph/mnt/biocollab/GAGE-B/b-sides/R_sphaeroides_MiSeq/assembly/mismatch_corrector/scaffolds/configs/corrector.info /home/ph/mnt/biocollab/GAGE-B/b-sides/R_sphaeroides_MiSeq/assembly/misc/assembled_scaffolds.fasta


== Dataset description file was created: /home/ph/mnt/biocollab/GAGE-B/b-sides/R_sphaeroides_MiSeq/assembly/mismatch_corrector/scaffolds/configs/corrector.info

asl avatar Nov 14 '17 08:11 asl

2017-11-14_163229

This is what I saw. Please check the new log file uploaded.

luke831215 avatar Nov 14 '17 08:11 luke831215

Hi Luke Did you resolve the problem? I have the exact problem, but only if i use --careful parameter

ChQuinteroC avatar Apr 06 '20 19:04 ChQuinteroC

Hi guys, I have the same problem like ChQuinteroC I only use --careful

Keyth-ru avatar Oct 27 '20 06:10 Keyth-ru

I have installed spades, I am unable to produce the scaffolds while running either with "spades.py --test" nor with spades.py -1 sample_1P.fastq.gz -2 sample_2P.fastq.gz -o spades_output. I am attaching params.txt and spades.log. Kindly help with solution. spades.log params.txt

ShobhitaShah avatar May 29 '24 07:05 ShobhitaShah