Leland McInnes
Leland McInnes
The ability to compute jaccard using sparse structure is dependent on *the number of non-zeros* per row, not on the actual dimension. So 10^6 dimensions might be fine if most...
It should work with numba 0.55; 0.51.2 is a *minimum* numba version. Perhaps this is to do with poetry?
Really UMAP is primarily good at dealing with continuous variables, or at the very least numeric vectors for which a sensible distance metric exists or can be defined. The problem...
Sadly I`m not sure that there is much I can do about this as a certain amount is down to the operating system. I agree that it is a potentially...
I don't believe this is anything I can fix at all easily -- it comes down to lower level libraries like numpy which I rely on. Sorry. On Mon, Jun...
I have some code for it [here](https://github.com/scikit-learn-contrib/hdbscan/blob/master/hdbscan/validity.py). It has some dependency on hdbscan, but in practice that amounts to the mst_linkage_core, which you can replace with any suitable minimum spanning...
I have code for dendrogram simplification and plotting the resulting pruned/condensed dendrogram as part of my clustering project (http://github.com/scikit-learn-contrib/hdbscan). The `condense_tree` tree routine in `hdbscan/_hdbscan_tree.pyx` handles tree simplification and there...
Is this something you have time to try? My initial attempts ran into the difficulty that I was unsure how to get a string in python giving a uri of...
This would certainly be a good thing. I believe that this is potentially possible, but it will definitely be slower than a bulk/batch build. I'm not sure when I or...
I believe you will have to call ``prepare()`` before update will work. That probably is not well documented. Sorry. On Thu, Feb 17, 2022 at 7:30 PM Hobson Lane ***@***.***>...