hdbscan icon indicating copy to clipboard operation
hdbscan copied to clipboard

cluster tree created nodes be excluded when extracting exemplars during modifying PredictionData

Open Dicksonchin93 opened this issue 2 years ago • 0 comments

cluster tree nodes (not nodes from the raw data) should be excluded when extracting exemplars during initializing or modifying PredictionData in https://github.com/scikit-learn-contrib/hdbscan/blob/94744a5715a639ecb084e803f96ddf6c909c3e07/hdbscan/flat.py#L789-L802 https://github.com/scikit-learn-contrib/hdbscan/blob/94744a5715a639ecb084e803f96ddf6c909c3e07/hdbscan/prediction.py#L134-L143

When cluster tree nodes are part of the child nodes that got selected to be the exemplar, this part of the code fails https://github.com/scikit-learn-contrib/hdbscan/blob/94744a5715a639ecb084e803f96ddf6c909c3e07/hdbscan/flat.py#L802 https://github.com/scikit-learn-contrib/hdbscan/blob/94744a5715a639ecb084e803f96ddf6c909c3e07/hdbscan/prediction.py#L143

Should the cluster nodes be removed before trying to extract from the raw data? Thoughts anyone?

Dicksonchin93 avatar Jul 12 '22 14:07 Dicksonchin93