minidot
minidot copied to clipboard
Fast and pretty dotplots for whole genomes assemblies using minimap and R/ggplot2
** minidot
[[etc/header.png]]
Quickly produce pretty dotplots from [[https://github.com/lh3/minimap][minimap]] mappings using R/ggplot2.
**** Dependencies
- R packages: ggplot2, stringr, scales, argparse
- curl, gzip (for samples only)
**** Install #+BEGIN_SRC git clone https://github.com/thackl/minidot.git #+END_SRC
**** Samples #+BEGIN_SRC make sample-prochlorococcus make sample-arabidopsis #+END_SRC
Make sample commands will download required data, set up minimap (if not already in $PATH) and run minidot.
***** Prochlorococcus ecotypes #+BEGIN_SRC
actual minidot command invoked by make sample-prochlorococcus after setting up the data
minidot -s -o prochlorococcus.pdf MED4.da SB.fa NATLA2.fa LG.fa MIT9313.fa #+END_SRC
Quickly compare genome organization among multiple bacterial genomes (finished and draft).
[[etc/prochlorococcus.png]]
***** /A. thaliana/ vs /A. lyrata/ - compare two related eukaryotes #+BEGIN_SRC
actual minidot command invoked by make sample-arabidopsis after setting up the data
minidot -o arabidopsis.pdf A.lyrata.fa A.thaliana.fa #+END_SRC
Compare two related eukaryotic genomes. Larger genomes are automatically run in =fast= mode, which is less sensitive and looks only for larger stretches of similarity (>500bp).
For /A.t./ vs /A.l./, mapping & plotting takes about 1.5 minutes (desktop PC, 3 threads, 8GB RAM). That's probably faster than downloading the genomes in the first place...
[[etc/arabidopsis.png]]
**** Contact Any kind of feedback is highly appreciated. Feel free to either report [[https://github.com/thackl/minidot/issues/new][issues]] directly via github or shoot me an email.