Leland McInnes
Leland McInnes
Sorry, I have been very busy with a number of other projects, and this was relatively low on the priority list (I was hoping to significantly overhaul the soft clustering...
I believe this did get fixed actually, but due to some other patches elsewhere that intersected with this. What version of hdbscan are you running?
Hmm, let me take a look again.
I have a proposed fix -- let me know if the current master resolves the issue for you.
Thanks for the example. Unfortunately it looks like I'm not going to have time to dig into this until Tuesday. Hopefully it can wait until then, at which point I'll...
Digging in to this I think the answer (unfortunately?) is that this is "just how it works". The soft clustering considers the distance from exemplars, and the merge height in...
I understand. I have plans for a different clustering algorithm that is more amenable to producing soft clustering via something along these lines, but likely rather more robustly. Sorry I...
Unfortunately my time has been rapidly soaked up by other projects (largely UMAP), so I haven't had the opportunity to sit down and code up the new algorithm as I...
Not as yet, sorry.
The sum will be the probability that the point is in any cluster. Since HDBSCAN considers some points "noise" you can think of this as one minus the probability that...