hdbscan icon indicating copy to clipboard operation
hdbscan copied to clipboard

new clusters from noise

Open vamsibanda opened this issue 3 years ago • 1 comments

I have trained a model initially on a custom dataset and used the approximate_predict operation to compute a predicted cluster for the new data points every day. Over time, I have collected all the noise examples and found out there is a possibility for new clusters. If I fit the model again it will change the existing clusters. Is there a way I can find new clusters from noise and add them as an extension to the existing model?

vamsibanda avatar Jan 30 '22 19:01 vamsibanda

I don't think it is possible to add clusters to an existing model, or to train a new model to keep some existing clusters the same (#70). Perhaps the easiest thing would be to keep the first model, train a second model on the noise, and combine the results.

adilosa avatar Feb 01 '22 19:02 adilosa