feature_engine icon indicating copy to clipboard operation
feature_engine copied to clipboard

Feature engineering package with sklearn like functionality

Results 138 feature_engine issues
Sort by recently updated
recently updated
newest added

Added MAD outlier rule Refactored base code and tests

**Is your feature request related to a problem? Please describe.** We can add outlier rule similar to gaussian (mean +- k * std) but using robust statistics (median +- k...

This is my shot for #428. I noticed `OrdinalEncoder` inherits the `transform` method from `CategoricalMethodsMixin`. So I added an additional condition to output -1 only for `OrdinalEncoder`. Additionally I had...

Hi @solegalli, This is my shot for fixing #489. After checking the code in detail I think the issue affects not only `PolynomialFeatures` but also Sklearn `OneHotEncoder`. When using `SklearnTransformerWrapper`...

The OrdinalEncoder has an errors argument which can either raise an error or output NaNs when encountering new categories. For this particular class, it'd make sense to output -1 when...

good first issue
enhancement

Presently, there are no packages in python to calculate Information Value using WOE for nominal/categorical variables. As WOE Encoder is already available in Feature Engine, hence I am raising the...

new transformer

Related to [`RareLabelEncoder`](https://feature-engine.readthedocs.io/en/latest/api_doc/encoding/RareLabelEncoder.html), I wrote an `UnstableLabelEncoder` that groups categories that are unstable over time. You define `n_time_buckets` (for example `5`) and a `time_variable`. Then I cut the `time_variable` into...

**Is your feature request related to a problem? Please describe.** Staple thresholds 0.1 and 0.25 are empirical, but there are alternatives to calculate threshold based on data and parameters of...

As per description here: https://medium.com/towards-data-science/mrmr-explained-exactly-how-you-wished-someone-explained-to-you-9cf4ed27458b and references therein.