category_encoders issues

OneHotEncoder(sparse=True)

11

[sklearn.preprocessing.OneHotEncoder](https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html) has the option `sparse=True`, to return the output in a scipy.sparse matrix. This can be really useful if you have categories with high cardinality. Would it be possible to...

zachmayer

enhancement

Parallel encoding of features in woe encoding

1

are you planning to implement parallel encoding of features for woe encoding ?

Venki-Kavuri

enhancement

BinaryEncoder doesn't work together with cross_val_predict from sklearn

1

Versions sklearn: '0.22.1' category_encoders: 2.1.0 Issue - if I use a fitted BinaryEncoder instance in a custom classifier, there is a ValueError "ValueError: Must train encoder before it can be...

grialx

bug

OrdinalEncoder fails when the input is entirely numeric or can be converted to numeric values

6

**Summary** `OrdinalEncoder.fit()` throws an exception when the input values are entirely numeric (I.E. `[1, 2, 3, 4, 5]`) or can be converted to be numeric (I.E. `['001', '002', '003', '004',...

djrscally

Circular categories encoding

3

Hi! I came up here searching about how to encode categorical variables which have a circular distance relation (such as the days of the week, where the last day, sunday,...

DelgadoPanadero

enhancement

Transformers for continous data

3

Hi I know that library is focused on categorical-encoding, but I think there is a value in adding at least `StandardScaler` and `MinMaxScaler`, with such nice interface like we have...

mglowacki100

enhancement

Test fails

1

I am packaging this Python package on nixpkgs. When running test, I ran into: ``` error: [Errno 2] File b'source_data/mushrooms/agaricus-lepiota.csv' does not exist: b'source_data/mushrooms/agaricus-lepiota.csv' ``` I think that the path...

GuillaumeDesforges

bug

HashingEncoder doesn't transform the data

27

I'm trying to see the output of using HashingEncoder, and I've used the original sample code from the documentation, and I don't see any differences between the transformed and non-transformed...

xKHUNx

[Feature request] Benchmark of encoding strategies for different tasks

1

I know that I'm asking for a lot here but it'd be great to have some idea of what encoding strategies are useful in some cases : classification vs regression...

Nathan-Furnal

Speed up HashingEncoder with util.hash_pandas_object

Extend HashingEncoder to work with `util.hash_pandas_object` as the hashing function. **Reasoning**: Currently, HashingEncoder relies on hashlib. Hashlib is nice, however: 1. hashlib works only value by value -> no vectorization...

janmotl

enhancement

help wanted

category_encoders
category_encoders copied to clipboard

Metadata

OneHotEncoder(sparse=True)

Parallel encoding of features in woe encoding

BinaryEncoder doesn't work together with cross_val_predict from sklearn

OrdinalEncoder fails when the input is entirely numeric or can be converted to numeric values

Circular categories encoding

Transformers for continous data

Test fails

HashingEncoder doesn't transform the data

[Feature request] Benchmark of encoding strategies for different tasks

Speed up HashingEncoder with util.hash_pandas_object

← Metadata

Owner

Metadata

category_encoders category_encoders copied to clipboard

Metadata

← Metadata

Owner

Metadata

category_encoders
category_encoders copied to clipboard