feature_engine
feature_engine copied to clipboard
Feature engineering package with sklearn like functionality
Added MAD outlier rule Refactored base code and tests
**Is your feature request related to a problem? Please describe.** We can add outlier rule similar to gaussian (mean +- k * std) but using robust statistics (median +- k...
This is my shot for #428. I noticed `OrdinalEncoder` inherits the `transform` method from `CategoricalMethodsMixin`. So I added an additional condition to output -1 only for `OrdinalEncoder`. Additionally I had...
Hi @solegalli, This is my shot for fixing #489. After checking the code in detail I think the issue affects not only `PolynomialFeatures` but also Sklearn `OneHotEncoder`. When using `SklearnTransformerWrapper`...
The OrdinalEncoder has an errors argument which can either raise an error or output NaNs when encountering new categories. For this particular class, it'd make sense to output -1 when...
Presently, there are no packages in python to calculate Information Value using WOE for nominal/categorical variables. As WOE Encoder is already available in Feature Engine, hence I am raising the...
PR for #494
Related to [`RareLabelEncoder`](https://feature-engine.readthedocs.io/en/latest/api_doc/encoding/RareLabelEncoder.html), I wrote an `UnstableLabelEncoder` that groups categories that are unstable over time. You define `n_time_buckets` (for example `5`) and a `time_variable`. Then I cut the `time_variable` into...
**Is your feature request related to a problem? Please describe.** Staple thresholds 0.1 and 0.25 are empirical, but there are alternatives to calculate threshold based on data and parameters of...
As per description here: https://medium.com/towards-data-science/mrmr-explained-exactly-how-you-wished-someone-explained-to-you-9cf4ed27458b and references therein.