bigbacter-nf
bigbacter-nf copied to clipboard
Bacterial surveillance pipeline.
BigBacter
Overview
BigBacter is a pipeline aimed at simplifying bacterial genomic surveillance.
Main features
- Saves your samples to a personal database (can be stored locally or on the cloud)
- Includes database samples when they are closely related to new samples
- Optimized to reduce core genome shrinkage
- Optimized for speed and resource usage
- Multiple species/subtypes can be included in a single run
- Generates report-ready figures
Main analyses:
- Recombination-aware core SNP analysis (Snippy and Gubbins)
- Accessory distance analysis (PopPUNK)
- Phylogenetic analysis (IQTREE2 or Rapidnj)
Main outputs:
- Phylogenetic trees (Core SNPs)
- Distance matrices (Accessory distance and Core SNPs)
- Tabulated summary (QC metrics, linkage summaries, and more!)
- Outputs are summarized in Microreact files 🙌
Required Inputs
| Column Name | Description |
|---|---|
| sample | sample name |
| taxa | sample taxonomy (species or lower) |
| assembly | assembly file |
| fastq_1 | foward read |
| fastq_2 | reverse read |
BigBacter also requires a taxon-specific PopPUNK database (pre-made databases can be found here). (https://www.bacpop.org/poppunk/))
[!TIP] BigBacter is designed to run after general bacterial analysis (e.g, PHoeNIx, Bactopia, TheiaProk.)
How to use BigBacter:
Checkout the wiki to learn more!
BigBacter was originally created by Jared Johnson for the Washington State Department of Health. See a full list of contributors here.