sklearn-pandas
sklearn-pandas copied to clipboard
Pandas integration with sklearn
Transformer's `get_output_names` is getting deprecated in favor of `get_feature_names_out`. It will be removed by sklearn 1.2 (see sklearn v1.0 [changelog](https://scikit-learn.org/stable/whats_new/v1.0.html#changelog), scikit-learn/scikit-learn#18444, and, for example, [OneHotEncoder.get_feature_names](https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html#sklearn.preprocessing.OneHotEncoder.get_feature_names)). This PR: - Prefers `estimator.get_feature_names_out()`...
In the following lines the resulting prints do not change if the line `drop_cols=["salary"]` is commented out: ``` import sklearn.preprocessing import pandas as pd import sklearn_pandas data = pd.DataFrame( {...
I'm a little confused on what the license is supposed to be. With version 2.1.0, MIT is listed as the license in setup.py but the license file states Zlib. Is...
It would be valuable to have some mapping functionality which takes `y_true`, `y_pred`, `y_score`, `pred_decision` (output of `estimator.decision_function()`), `x` (`AUC` only), `y` (`AUC` only) and converts it into a standardized...
Hello, I wanted to know is there any way to do multiple transforms on multiple columns , treating each one seperately. I was able to implement it using Sklearn's `ColumnTransformer`...
As mentioned in [Dynamic Columns section](https://github.com/scikit-learn-contrib/sklearn-pandas#dynamic-columns) of the documentation, DataFrameMapper supports selecting columns dynamically during the fit operation by passing a custom callable or using sklearn.compose.make_column_selector. I've tried doing so,...
Currently, the keyword argument **python_requires** of **setup()** is not set, and thus it is assumed that this distribution is compatible with all Python versions. However, I found it is not...
## Reference issue [See discussion with sklearn](https://github.com/scikit-learn/scikit-learn/issues/21858) devs. ## Description It seems that sklearn is issuing DeprecationWarnings in correspondence of the calls to `get_feature_names()`. In particular this happens around [this...
Some packages convert the dataframe type into other types that have a similar interface (modin/dask). This causes issues with sklearn-pandas, more specifically this function: https://github.com/scikit-learn-contrib/sklearn-pandas/blob/master/sklearn_pandas/dataframe_mapper.py#L311 I fixed this issue by...
I was wondering whether it is possible to preserve column names when using a transformer that requires multiple columns of the dataframe. I'll try to illustrate what I mean with...