cuml icon indicating copy to clipboard operation
cuml copied to clipboard

[FEA] Enable HDBSCAN to build knn graph using NN Descent

Open jinsolp opened this issue 1 year ago • 4 comments

Description

Adds build_algo option to HDBSCAN, and allow HDBSCAN to build knn graphs using nn descent Now user can choose the knn graph build algorithm between "brute_force_knn" and "nn_descent" Defaults to "auto", in which case decides to run with brute force knn or nn descent depending on the given dataset size. "auto" decides to run with brute_force_knn if data has <= 50K rows. Otherwise decides to run with nn_descent.

agg_nnd = HDBSCAN(min_samples=16, build_algo="nn_descent")
agg_nnd.fit(data)

Running Benchmarks

Screenshot 2024-08-22 at 3 16 55 PM

jinsolp avatar Jun 17 '24 16:06 jinsolp

algo.cuh and get_raft.cmake will be changed after this PR in cuML and this PR in raft is merged

jinsolp avatar Aug 22 '24 22:08 jinsolp

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

copy-pr-bot[bot] avatar Sep 21 '24 17:09 copy-pr-bot[bot]

/ok to test

cjnolet avatar Sep 21 '24 18:09 cjnolet

/ok to test

dantegd avatar Sep 23 '24 21:09 dantegd