spades icon indicating copy to clipboard operation
spades copied to clipboard

Should I put option "--careful"?

Open WeiWei1112 opened this issue 7 years ago • 4 comments

Hi, I want to use Spades to assemble a ~40kb plasmid sequenced with Illumina Miseq (2X250nt). When I use the default option: spades.py -1 ../BWA/B13_vector_unmapped.R1.fastq -2 ../BWA/B13_vector_unmapped.R2.fastq --phred-offset 33 -o B13 I get many several small contigs like below:

>NODE_1_length_10836_cov_178.206088
>NODE_2_length_9772_cov_179.468326
>NODE_3_length_3195_cov_183.105606
>NODE_4_length_2559_cov_173.407072
>NODE_5_length_1809_cov_165.414388
>NODE_6_length_1761_cov_177.950428
>NODE_7_length_954_cov_175.383313
>NODE_8_length_789_cov_382.403323
>NODE_9_length_715_cov_332.863946
>NODE_10_length_666_cov_357.298701

But if I add the "--careful" option, I can get >B13_GGCTTAAG-TCGTGACC_1_length_34210_cov_184.621189

Apparently the result is much better with "careful" option. But the question is that do I throw away possible true variants if I just correct mismatches and indels? My sequence contains a gene cluster in which the genes might be similar but slightly different to each other. Do you have any suggestions for the option settings? Thank you so much!

Wei Wei

WeiWei1112 avatar Oct 01 '18 23:10 WeiWei1112

You may want to check the assembly graph to see what is around that contigs and why there were not assembled.

asl avatar Oct 16 '18 18:10 asl

You may want to check the assembly graph to see what is around that contigs and why there were not assembled.

Thanks for your reply! Do you know what does the "--careful" option do?

WeiWei1112 avatar Oct 29 '18 15:10 WeiWei1112

Sure. Per SPAdes manual (http://cab.spbu.ru/files/release3.13.0/manual.html):

--careful
    Tries to reduce the number of mismatches and short indels. Also runs MismatchCorrector – a post processing tool, which uses BWA tool (comes with SPAdes). This option is recommended only for assembly of small genomes. We strongly recommend not to use it for large and medium-size eukaryotic genomes

asl avatar Jan 11 '19 13:01 asl

确定。根据 SPAdes 手册 (http://cab.spbu.ru/files/release3.13.0/manual.html):

--careful
    Tries to reduce the number of mismatches and short indels. Also runs MismatchCorrector – a post processing tool, which uses BWA tool (comes with SPAdes). This option is recommended only for assembly of small genomes. We strongly recommend not to use it for large and medium-size eukaryotic genomes

确定。根据 SPAdes 手册 (http://cab.spbu.ru/files/release3.13.0/manual.html):

--careful
    Tries to reduce the number of mismatches and short indels. Also runs MismatchCorrector – a post processing tool, which uses BWA tool (comes with SPAdes). This option is recommended only for assembly of small genomes. We strongly recommend not to use it for large and medium-size eukaryotic genomes

Hello, if I'm assembling MDA data, is it better to use the --careful option ?

ZhaoruiZhou avatar Aug 19 '24 02:08 ZhaoruiZhou