Daniel Baker
Daniel Baker
Hi Jianshu, Great to hear - glad it's fast. You're right - the default output is similarity, so you'll assign points to clusters where their similarity is over that threshold....
Hey Jianshu - A signal 9 with dashing2 usually means that it ran out of memory, and it was killed by a process that terminates jobs that hit that kind...
Great news - thanks for letting me know! In the newest version, I've introduced some options you can use to lower peak memory, but `--fastcmp-shorts` should cut the memory requirements...
Hi Jianshu, We're still writing the preprint currently. I expect we'll submit it within the next two weeks. Thanks for checking in, Daniel
Hi Jianshu, Sorry for taking so long - I defended and moved over the new year/holiday. It's possible that incompleteness can affect how similar they seem to be. You could...
Hi Dasa, Thanks for the report - it's very helpful. I've confirmed that the `--asymmetric-all-pairs` with `-F paths.txt` is computing only one of the two containment scores, so I'm looking...
Hi again, I found the problem - Dashing2 was selecting symmetric containment instead of standard containment. This was responsible for the distance matrices being symmetric when they shouldn't have been....
Hi dejsha, Thanks for your bug report! You're right - Dashing2 is ignoring the distance metric for the --set option. This is also the case for weighted Jaccard (`--countdict`). I'll...
Hi, I've found and corrected the bug. I've merged it into `main`, and I'll let you know when the new binary releases are available. Thanks, Daniel
New binaries are out - You can fetch them from https://github.com/dnbaker/dashing2-binaries or from the tarball release https://github.com/dnbaker/dashing2-binaries/archive/refs/tags/v2.1.10.tar.gz. Thanks again, Daniel