sourmash icon indicating copy to clipboard operation
sourmash copied to clipboard

Quickly search, compare, and analyze genomic and metagenomic data sets.

Results 369 sourmash issues
Sort by recently updated
recently updated
newest added

In #2321 and https://github.com/sourmash-bio/sourmash/pull/2322 we delve back into multigather... and I remembered how annoying the CSV output is, in that it is output to the signature `filename` for each query....

5.0
revisit_me

luiz's talk about how we develop/design/evolve sourmash: https://www.youtube.com/watch?v=0jpnP8NtRfc&feature=youtu.be

talks
slides
videos

This is a summarization of remaining issues in #838, which has become unreadable :) ---- from @luizirber in https://github.com/dib-lab/sourmash/issues/838#issuecomment-573856948: I'll dump here some other pointers to `gather` improvements, and they...

speeding-up-gather

I'm on day 3 of a gather of the Hu S1 dataset against all genbank (500k+ genomes), and chatting with @taylorreiter about the find-the-unassigned script that @luizirber wrote, https://github.com/taylorreiter/cosmo-kmers/blob/master/scripts/unassigned.py it...

speeding-up-gather

while I'm thinking about it... it would be nice to move towards genuinely interactive search, gather,and MAGsearch. greyhound #1226 and greyhound.sourmash.bio/ is super cool, of course, but it's "only" searching...

speeding-up-gather

from [benchmarking on 1.2m signature genbank](https://github.com/dib-lab/2020-paper-sourmash-gather/issues/47), even fairly simple environmental metagenomes such as SRR1976948 are matching to lots of redundancy. we're looking at ways to deal with this at a...

speeding-up-gather

Luiz's talk on August 30th, 2022 at the annual JGI User Meeting: https://luizirber.org/talks/2022-08-30-JGI/slides.html

talks
slides

Bumps [serde_json](https://github.com/serde-rs/json) from 1.0.85 to 1.0.86. Release notes Sourced from serde_json's releases. v1.0.86 Support arbitrary_precision feature even in no-std mode (#928, thanks @​kvinwang) Commits 4217e8e Release 1.0.86 99d6164 Reorganize number.rs...

rust
dependencies

Hi, according to the manual at the moment, the reverse complement and in case of is_protein=True all 6 possible open reading frames are considered. Is there an option to keep...

I just ran gather on some genomes (attaching a snippet of results below). These are all single genomes. ``` overlap p_query p_match avg_abund --------- ------- ------- --------- 5.3 Mbp 44.2%...