Colton Baumler

Results 30 comments of Colton Baumler

I can rerun the script to create the lineage spreadsheet, but I don't see how a GenBank update could affect the GTDB and lineage file. We created those at the...

Thank you for walking through everything! I see my own misunderstanding clearly now. Would it be best to update the signature name with the species rank name from the lineage...

It's this kind of wisdom that keeps me showing up day after day.

I think if we add more info into the signature name and discuss it in the documentation, it would alleviate any confusion. It could also show in the output that...

Thanks for all the great explanations, ideas and details! One point I'm having trouble rectifying in my mind is the making signatures and taxonomies from GTDB data but naming the...

Truth, the representative genomic signature is the most important part of the database. While names are ephemeral, we are still using them. My understanding is we are using them in...

Please correct me if I misunderstood your comment about liking to see the output. I was referring to the fifth column of gather which returns the signature names of the...

For clarity, this workflow is: `sourmash sketch dna -p abund *.fq.gz` followed by `./filter-min-samples.py *.sig -o filtered.zip` or `./filter-min-samples.py *.sig -o ./output/` (from [this repo](https://github.com/ctb/2022-sourmash-filter-min-samples)). This will output a collection...

Wild idea. Thinking about Branchwater in relation to this technique. We have attempted to extract metagenome sequences from the SRA with random forest classified signatures. That was not as informative...

4a. branchwater search using the core genomic components of each common hash set and compare the results