Victor Lafargue comments

Results 35 comments of


                                            Victor Lafargue

Fix for non-contiguous strides

rerun tests

[BUG] silhouette_score / classlabels.py check_labels function throws CUDA_ERROR_INVALID_VALUE with large labels/X sizes.

@adityak74 this is indeed the issue. Could work on a fix for this : https://github.com/rapidsai/cuml/pull/5971.

Clustering does not accept input from CountVectorizer or TfidfVectorizer

If you want to scale ideally on multiple GPUs, I would recommend using the `HashingVectorizer` as a replacement to the `CountVectorizer`. It should yield good results while being stateless /...

Clustering does not accept input from CountVectorizer or TfidfVectorizer

There is an alternative solution which is simply to reduce the value for the number of features (`n_features` default=2**20). I recommend using cuML's `HashingVectorizer` (instead of DaskML) for GPU support,...

[BUG] UMAP random_state doesn't provide consistency

Are we planning to fix spectral initialization already or should I open a PR to update the documentation regarding this limitation for now? cc @cjnolet @dantegd