Nick Becker
Nick Becker
Thanks for linking that issue @Wuuzzaa ! @lindeberg25 , we'd love to learn more about your use case and performance impact of using cuML's Random Forest vs. scikit-learn's RF. Let's...
@hectorpatino have you confirmed that the data going into the UMAP calls is the same? The notebooks are slightly different. I recommend filing a [cuML issue](https://github.com/rapidsai/cuml/issues/) that includes a minimal,...
This is incredibly exciting! More than any individual string operation, one of the most common pain points I see in workflows is the inability to bring strings along as a...
I cannot consistently reproduce this (though others have seen it as well). There may be something subtle happening with the Naive Bayes classifier.
Lost the logs from the failure in the automated nightly run, unfortunately. Could not reproduce this with 100 consecutive runs of Q28. Will be triggering a few long-running tests to...
Thanks for the bump John. Anecdotally, we find that the most effective setup includes setting the host memory limit as the maximum available system memory (`(free -m | awk '/^Mem:/{print...
> IMO, this would be too dangerous for a default. ... I agree with Peter here. What's most effective for a given workflow doesn't necessarily translate to what's most effective...
@RumiAllbert are you also only experiencing this using WSL2?
Can you provide details about your environment, library versions, and Python version?
I believe this is actually driven by `min_samples` (which is by default None and set to the value of `min_cluster_size`), as this is what determines the minimum number of neighbors...