Christian Lorentzen comments

Results 394 comments of


                                            Christian Lorentzen

RFC adopt narwhals for dataframe support

So currently, there is no critical voice, but still time to raise one. Please do so. Let's see how contributors would prefer to add narwhals: - 🚀 add narwhals as...

ENH expose `max_memory_mb` parameter to contol trade-off speed/memory during computation

I have a question about API: Wouldn't it be something completely new for users to specify some amount of memory (RAM) limit? I would prefer to have a much simpler,...

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

The origin of the "friedman_mse" is the paper "Greedy function approximation: A gradient boosting machine." by Jerome H. Friedman [doi:1013203451](https://doi.org/10.1214/AOS/1013203451) around Eq 35. He even mentions that this is the...

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

Just read (or re-read) "Greedy Function Approxiations" again, in particular section 4.6 (and 4.5 and the reference "Additive logistic regression"). The point is that both cases use squared error, but...

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

> Ok but if both uses squared error, then the splitting criterion should be "squarred_error" and "friedman_mse" has no reason to exist (and anyway it computes exactly the same thing...

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

@cakedev0 I think there is no difference of squared error and Friedman squared error: `GradientBoostingClassifier` implements 1. order gradient descent in function space according to [1]. There are no weights...

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

> Although, note that HGB does line search for loss functions that aren't differentiable: see this [function/docstring](https://github.com/scikit-learn/scikit-learn/blob/c7d040e4f23e7888125de0af52e640329c8b9a5a/sklearn/ensemble/_hist_gradient_boosting/gradient_boosting.py#L74-L90) in the code. Hint: look at git blame😉 The author thought about future...

Christian Lorentzen

RFC adopt narwhals for dataframe support

ENH expose `max_memory_mb` parameter to contol trade-off speed/memory during computation

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

MNT: Trees/forests: `"friedman_mse"` criterion isn't different from normal `"mse"` criterion

DOC remove redundant example multiclass logistic regression

MNT use the already computed excluded_set in screen_features_enet_gram

MNT use the already computed excluded_set in screen_features_enet_gram