feature_engine icon indicating copy to clipboard operation
feature_engine copied to clipboard

Feature engineering package with sklearn like functionality

Results 138 feature_engine issues
Sort by recently updated
recently updated
newest added

**Is your feature request related to a problem? Please describe.** Filtering Noisy features from a set of features can be easily accomplished by adding a one or more random variables...

new transformer

**Is your feature request related to a problem? Please describe.** From [Kaggle](https://www.kaggle.com/ryanholbrook/clustering-with-k-means) > The motivating idea for adding cluster labels is that the clusters will break up complicated relationships across...

new transformer

At the moment, the DecisionTreeDiscretiser returns the values of the tree predictions as the replacement of the original variables. I would like to add the option to return integers from...

enhancement

This code needs unit tests, but I wanted to get feedback on this newer implementation of the class. closes stale PR #359

*Is your feature request related to a problem?* `OrdinalEncoder` should accept nulls. Sometimes you don't want to impute directly but using Imputing Options of XGBoost, LightGBM or CatBoost. Because of...

**Is your feature request related to a problem? Please describe.** Hi Sole, currently Im doing the mini course of [feature engineering](https://www.kaggle.com/ryanholbrook/target-encoding) on kaggle. The last seccion is about Mean Encoding...

urgent
priority

**Describe the bug** A clear and concise description of what the bug is. When using PolynomialFeaturs + SklearnWrappers the base features are duplicated, when trying to dedup using DropDuplicateFeatures the...

bug
good first issue
urgent
easy

Closes #394. The transformer accepts a dictionary that defines how numeric variables will be discretized/organized into bins. The transformer calculates and returns the average for the respective bins.

In multivariate imputation, we estimate the values of missing data using regression or classification models based of the other variables in the data. The iterativeimputer will allows us only to...

Closes #413 Notes from #413: Many transformers in feature engine require that y is binary. At the moment we do this check within each transformer. We should create a function...