sourmash
sourmash copied to clipboard
Create script to extract intersections between sets from gather output
Pyupset (python) and UpsetR (R) generate nice plots with interactions between sets but pyupset does not print the unique values/groups or the intersection between these sets. @halexand and I can take this on. Just placing this here as a reminder.
@taylorreiter what are you using these days?
Something like this! Which isn't very pretty, but does the job. As @brooksph stated, upset plots are nice, but they do not print the unique values/groups or the intersection between these sets...so I brute force it in a very not clever way at the bottom of the attached R script. plot_gather_output_var_imp.R.txt
(could you post an example image? :)
really nice, thanks taylor! here it is, inline --

ref #1234
aaaaactually this strikes me as an eminently doable thing for a new sig subcommand - basically something like sourmash sig overlap but for n signatures, not just two.
could output a file format that is trivial to load into 'upset' plotting code per #1234.
This is the format that goes into the R libraries
de novo kaa-mer reference
1 1 0
1 0 0
1 1 0
1 1 0
1 0 0
1 0 0
adding the upset command via the betterplot plugin in https://github.com/sourmash-bio/sourmash_plugin_betterplot/pull/35 - it produces figures like this: