Leland McInnes

Results 487 comments of Leland McInnes

It looks like a broken link in the docs; what you want is something like: ```python HDBSCAN(cluster_extraction_method="leaf") ``` See this section of the docs: https://hdbscan.readthedocs.io/en/latest/parameter_selection.html#leaf-clustering

If you have the model and or tree saved off then yes, you can extract leaf clusters directly, but it takes a bit of code. Probably the easiest way to...

Wheel building got broken at some point; I would appreciate it if you wanted to update the readme. On Tue, Feb 8, 2022 at 2:54 PM Tyler Kennedy ***@***.***> wrote:...

I was going to suggest that you need to set approx_min_span_tree to False, but I see you've already done that. The fact that even with that there are some differences...

Thanks, I'll look into this as soon as I get some time. This definitely seems like there may be a corner case bug somewhere. I definitely appreciate the data for...

Unfortunately no. I don't believe it is hard but I have simply not had the time to dig into the code and sort out exactly why it isn't working. Pull...

Some comments: The ``approx_min_spanning_tree=False`` needs to be in the vector data rather than the distance matrix version. That may remedy some of these issues. The other issue is that there...

Unfortunately yes. I had intended to write a new clustering library to address this and many other issues, but time has so far not permitted this. I remain hopeful that...

That is definitely a problem. I'll look into it as soon as I get a chance. In the meantime you can try 'arccos' as a metric, which may work better.

It seems that the issue is that we are instantiating ball/kd-trees from sklearn, which does not support such distance metrics (yet). If you have a small enough dataset then adding...