Leland McInnes comments

Results 492 comments of


                                            Leland McInnes

Crash when running with larger dataset

That sounds troubling, but I can't say too much without a little more information. Presumably the whole thing is segfaulting somewhere inside numba's workload. Are you using any different metrics,...

Crash when running with larger dataset

Any chance you could try installing pynndescent and see if that makes any difference?

Crash when running with larger dataset

I'm afraid this may have to suffice as a workaround for now -- I'll try to figure out what the issue might be, but it will likely be hard to...

Crash when running with larger dataset

I'm glad it is working. The crash is very puzzling. I am seeing some crash issues with a new metric I am implementing in pynndescent (it won't be the cause...

inverse_transform doesn't work on 1D data

Not at present unfortunately. It is possible, but it would need a separate code path to do so, which doesn't exist at this time. Sorry.

Systematically determine `min_dist` and `n_neighbors`

If you have specific labels to measure against you could cluster the embedding (with, say, hdbscan) and look at the adjusted Rand score (``adjusted_rand_score`` in ``sklearn.metrics``). of the clustering against...

Systematically determine `min_dist` and `n_neighbors`

If there are no labels then there is not "truth", so no, there isn't a general method that works without labels.

Systematically determine `min_dist` and `n_neighbors`

The ``n_neighbors`` parameter in UMAP and the ``n_samples`` parameter in DBSCAN/HDBSCAN mean different things. They are, at least, measured in the same units. There is some reason to believe one...

Scikit-learn 1.3.0 release induce a bug

Yes, it looks like they've refactored how all of that works quite a bit. There's no easy way to handle that anymore looking at the current setup. I think the...

Scikit-learn 1.3.0 release induce a bug

This should be fixed in master now. I'll see if I can get a release out soon.