spades icon indicating copy to clipboard operation
spades copied to clipboard

What is the rationale of jump in contigs.paths?

Open Martin-lc opened this issue 6 months ago • 3 comments

Description of bug

Hi there, I have a question:

metaSPAdes has output 'contigs.paths' which contains the path of segments connected either by links (denoted by a comma between segments) or by jumps (denoted by a semicolon between segments).

My understanding is that jump is something you use to put multiple contigs on a scaffold and the contigs connect to each other on the scaffold via jump.

So I expect jumps in 'scaffolds.paths', but why are there jumps in 'contigs.paths' as well, which should represent solely contigs?

spades.log

spades.log

params.txt

params.txt

SPAdes version

SPAdes v4.0.0

Operating System

Red Hat Enterprise Linux 9.0

Python Version

Python 3.13.0

Method of SPAdes installation

conda

No errors reported in spades.log

  • [x] Yes

Martin-lc avatar Jun 10 '25 00:06 Martin-lc

You're right, this seems to be some kind of issue. Is it possible for you to share a reproducer with us?

asl avatar Jun 10 '25 18:06 asl

Hi there, thank you for the response. Can you specify what do you want me to share here?

The parameters and settings are given above in the params.txt. The jump presence in contigs.paths is very prevalent from my experience working with multiple read files.

One of the read files can be downloaded here: wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR154/000/ERR1543540/ERR1543540_1.fastq.gz wget -nc ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR154/000/ERR1543540/ERR1543540_2.fastq.gz

Martin-lc avatar Jul 09 '25 18:07 Martin-lc

Do you think this is an issue where the contigs.fasta erroneously included these jump-containing contigs (and it should be included in the scaffold.fasta instead)? Or are there supposed to be no jump to begin with between those segments?

Martin-lc avatar Jul 09 '25 18:07 Martin-lc