Soledad Galli comments

Results 354 comments of


                                            Soledad Galli

Create check_y_is_binary() check. Use in transformers in which the dependent variable must be binary.

Hi @Morgan-Sell Thanks for the PR. I am thinking out loud below: This function would be used by: - MeanEncoder - OrdinalEncoder - PRatioEncoder - WoE For categorical targets: -...

Create check_y_is_binary() check. Use in transformers in which the dependent variable must be binary.

More on this: we can use the type_of_target function from sklearn to check if the target is binary as we did here: https://github.com/feature-engine/feature_engine/blob/65fedf100bc85f364b26838866a264dbf7320841/feature_engine/discretisation/decision_tree.py#L166 We could add a parameter to this...

Create check_y_is_binary() check. Use in transformers in which the dependent variable must be binary.

Hi @Morgan-Sell It's me again :p I did some performance comparison, and `type_of_target` is slower than `y.unique()`. We have the `unique()` implementation is WoE, so we should stick with that...

add function to check that y is binary in dataframe_checks

> Actually, PRatio will be deprecated, so the only transformer atm using binary classifcation exclusively is WoE. Not worth the effort.

Polynomial Feaatures + SklearnWrapper weird behavior

Hi @datacubeR Thank you for raising the issue. Apologies for the delay. I was on holidays until yesterday. To check if I understand this correctly, the first pipeline should return...

Polynomial Feaatures + SklearnWrapper weird behavior

Yes. The PolynomialFeatures from sklearn is designed to just return the polynomial features. It operates over the entire dataset. As such, the result will have the original features, which are...

include code coverage

Sounds good @Morgan-Sell Shouldn't be too difficult (she said, :p) Thank you!

Ignore NaNs before using `OrdinalEncoder`

Hi @datacubeR Thanks for the suggestion. You are certainly not the first one who'd like the encoders to support NaN. I think @david-cortes made a similar suggestion here #481, am...

feature creation: create new features by combining variables with decision trees

Will pick up this on the week of Sept 7. At the moment I am without my computer. Thanks!

feature creation: create new features by combining variables with decision trees

I've just read in the article, that they add the features discretised as additional features to the dataset instead of replacing the existing ones. So we can expand the decision...