nullarbor icon indicating copy to clipboard operation
nullarbor copied to clipboard

including low coverage sequences causes nullarbor to exit prematurely without error message

Open kwongj opened this issue 9 years ago • 6 comments

Nullarbor seemed to stop during assembly stage using spades - I couldn't see any error message. I found 2 sequences that were low coverage (9x and 17x), and when I re-ran nullarbor in the same folder using an edited Makefile excluding these two sequences, it ran as expected.

kwongj avatar Feb 01 '16 01:02 kwongj

Garbage in, garbage out.
Megahit would probably do ok with it.
Not really a problem I can solve, and yes, Makefiles don't handle this well :(

tseemann avatar Jul 09 '17 11:07 tseemann

Perhaps add an error message if the estimated coverage is too low?

kwongj avatar Jul 09 '17 13:07 kwongj

@kwongj that would be good, but a little tricky because of the makefile architecture. in fact most pipelines struggle with "business logic" in the middle of things.

there are a couple of ways it might be able to be handled.

when you say "error message" do you mean exit the pipeline or just print something?

tseemann avatar Jul 09 '17 23:07 tseemann

I think it would be reasonable to exit the pipeline, but at the very least, it would be nice to print something to stderr.

  • "the dream" : assemble genomes and perform typing, resistome etc. for those sequences with adequate depth, exclude sequences with low depth, exit the pipeline prior to phylogenetic analysis
  • "the achievable" : print a warning to stderr that sequenceX and sequenceY have low sequencing depth ± exit the pipeline.

kwongj avatar Jul 10 '17 11:07 kwongj

We have the same issue where nullarbor stopped at the assembly step. We had to manually invoke make several times to complete the pipeline. Our coverage is over 100 fold. We tried doing spades manually using cleaned reads generated by the trimmomatic step as well as uncleaned reads and the runs completed.

labrat9 avatar Aug 08 '17 02:08 labrat9

@labrat9 You can run make -k which "keeps on going" despite any failed sections.

make itself has no capability to do "business logic". I would need to add in "helper scripts" which check things along the way and exit.

I should probably do this for coverage estimation at minimum.

tseemann avatar Aug 08 '17 05:08 tseemann