Daniel Baker
Daniel Baker
Hi Joel - That's a fair point. Dependency management can be a mess and submodules has been a pretty reasonable way to handle them, but they come with some extra...
Thanks for pointing this out! This was an oversight on our part. This is added as of https://github.com/dnbaker/dashing2/pull/74. Best, Daniel
Hi - I suggest generating a sparse distance matrix when the number of entries is greater than about 50,000. You can make it sparse by choosing a minimum similarity (e.g.,...
Hi Xiaoming - Thank you for this issue! I'm looking into it. I really appreciate the feedback and I'll let you know when it's fixed up. Best wishes, Daniel
Checking back in on this - I seem to have fixed this issue on my machine and have updated the main branch accordingly. (See the linked PR.) You could build...
Hi Xiaoming, You found another bug. Thank you! There's a large feature-set (lots of ways to run computation), and we were erasing the sketched data because I was opening the...
Hi Xiaoming, Thanks again! I can reproduce it, but only with some builds, which is confusing to me. The version I've built of `dashing2` for my laptop is working, but...
Hi - The SetSketch is a great option. I think it's the best way to sketch and compare unweighted sets. In this library, we provide two implementations: 1. CSetSketch, which...
The CSetSketch is the un-truncated SetSketch. If you set b = 1, you'll recover it. It's simpler to compute since it doesn't need to compute as many thresholds. I found...
Hi, Thanks for letting me know! There were some recent changes to support M1, but in the process I broke Intel x86-64 support. I've patched this up in [this PR](https://github.com/dnbaker/sketch/pull/64)....