sourmash
sourmash copied to clipboard
Quickly search, compare, and analyze genomic and metagenomic data sets.
per https://github.com/AllTheBacteria/AllTheBacteria/issues/28
https://hackmd.io/ph9Q4nGHTb6T1mVJSV5yqw?view ref https://github.com/sourmash-bio/sourmash/issues/3233#issuecomment-2203433558, cc @musquita - # combining results from multiple different gathers Suppose you are using `fastgather`, and you want to combine the results from searching against multiple zip...
https://hackmd.io/tKpLr1ISR9mqHHmZbEvcow?view # containment search tutorial: mgmanysearch ## Install relevant sourmash stuff ``` mamba create -y -n contain_tut sourmash_plugin_branchwater mamba activate contain_tut pip install sourmash_plugin_containment_search ``` ## Get data Download 2.2...
this could be a good & substantive next step towards higher-level oxidation of sourmash internals - what if we added a `load_from` plugin entry point to branchwater that loaded zip...
updates: - [github.com/astral-sh/ruff-pre-commit: v0.4.10 → v0.5.0](https://github.com/astral-sh/ruff-pre-commit/compare/v0.4.10...v0.5.0)
Saw this in North Bay Python 2024, seems like a great complement (or replacement?) for `asv`
hackmd: [link](https://hackmd.io/EoYofvlFQ0SPF3wANO2w7Q?both) # gather benchmarking - sourmash v4.8.10 / branchwater plugin v0.9.5 Source repo: [sourmash-bio/2024-benchmark-gather](https://github.com/sourmash-bio/2024-benchmark-gather) ## sample SRR1976948 This sample contains 177 genomes. Benchmarking results with 64 threads (note pygather...
Hi, I have been struggling with finding an option to run on the sourmash gather command using multiple threads. Is this not available?
https://github.com/lskatz/dists2trees - make sure we are friendly with it
when run as part of [the release procedure](https://sourmash.readthedocs.io/en/latest/release.html)