LazyB
LazyB copied to clipboard
Economic Genome Assembly from Low Coverage Illumina and Nanopore Data
MuCHSALSA
For an efficient implementation of this prototype please visit: https://github.com/0x002A/MuCHSALSA/
Lazy B
The Bradypus is a lazy but very effective animal.

Introduction
This is a preliminary version of the tool under rapid development.
Gatter, Thomas, et al. "Economic genome assembly from low coverage Illumina and Nanopore data." 20th International Workshop on Algorithms in Bioinformatics (WABI 2020). Schloss Dagstuhl-Leibniz-Zentrum für Informatik, 2020.
Please contact us with any issues and suggestions.
Requirements
Following tools must be installed:
- jellyfish 2
- bbduk
- Abyss 2 (using 'abyss-pe')
- minimap2
- modified version of racon (racon_mod in this project, compiled for generic linux)
Usage
call the script lazyB_pipeline.sh as follows:
/path/lazyB\_pipeline.sh \[k-mer-size-filter\] \[k-mer-size-assembly\] \[name\] \[illumina-inputfile-1\] \[illumina-inputfile-2\]] \[nanopore-inputfile\] \[output-folder\]
example:
/path/lazyB\_pipeline.sh 50 90 Scerevisiae illumina\_paired\_end\_1.fa illumina\_paired\_end\_2.fa nanopore\_data.fastq output-folder
Note: Parts of the pipeline run on 8 cores. To change the number of available cores, set the value manually in lazyB_pipeline.sh.
[k-mer-size-filter] specifies the k-mer size for k-mer counting in raw illumina data. Reads with highly abundant k-mers are removed from the data. Starting at k=50 is recommended.
[k-mer-size-assembly] specifies the k-mer size during illumina assembly (here using Abyss). Starting at k=90 is recommended.
Comment out the following line in lazyB_pipeline.sh to disable the experimental polishing step.
$SCRIPTPATH/racon_prokrast -u -t $CORES $TMP/temp_1.query.fa $TMP/temp_1.align.paf $TMP/temp_1.target.fa > $OUT/03.assembly.fa