C. Titus Brown
C. Titus Brown
https://github.com/ctb/sourmash_plugin_abundhist is incredibly useful in several situations, but it needs some love. ref https://github.com/ctb/sourmash_plugin_abundhist/issues/4 and https://github.com/sourmash-bio/sourmash/issues/3458, but also: - better defaults and/or automated behavior - smoothed histogram output - other...
workshop link: [K-mer Workshop for Biodiversity Genomics](https://github.com/WCSCourses/K-mer_Biodiversity_Genomics_2025/) tutorials by @bluegenes supported by @bettafische [1. An introduction to FracMinHash sketching for sequence comparisons](https://hackmd.io/@bluegenes/H1ItsC_fxx) [2. Comparing genomes and metagenomes using FracMinHash sketches](https://hackmd.io/@bluegenes/Byh_JfKGgg)...
NOTE: PR into #3647
see #1037 - protein sketching doesn't warn you if you're sketching DNA sequence (but it should). making an issue so it doesn't get lost in our backlog of pull requests!
One of the most dangerous flaws in our protein hash calculations is that sourmash doesn't do any sequence type checks: you need to explicitly specify `--input-is-protein` for amino acid inputs,...
I built the following RocksDB databases on farm (so, only locally accessible, for now). All under: `~ctbrown/scratch3/2025-build-farm-rocksdb/rocksdb`. I will move them somewhere better when I have a moment, and update...
Luiz shows us a nice command: https://github.com/AllTheBacteria/AllTheBacteria/issues/54#issuecomment-2585480967
so, for example `{ ... }` will get interpreted improperly.
I am not sure but I think we might be missing abund columns?