category_encoders
category_encoders copied to clipboard
A library of sklearn compatible categorical variable encoders
Hello all. First, thanks for checking out and participating in this project. I started it a few years ago and with the help of many got it into a pretty...
I'm trying to encode some categorical variables and I found that I'm getting too much dimensions due to BinaryEncoder starting labeling from 1 instead of 0. I'm working on the...
Hi. I would like to propose an enhancement for OrdinalEncoder, although it may also be relevant for other encoders and could be implemented in a generic fashion. In particular, it...
## Expected Behavior There are a variety of applications in which zero-indexing would be preferred for the OrdinalEncoder. One example is preparing features for a PyTorch model with categorical embeddings,...
Fixed future warning "is_categorical is deprecated and will be removed in a future version. Use is_categorical_dtype instead". Fixes # ## Proposed Changes - elif pd.api.types.is_categorical(cols): + elif pd.api.types.is_categorical_dtype(cols):
Hi, this is not a bug report but rather a feature request (not sure if this is the place or how). It would be great to be able to specify...
Dear maintainers, would you please consider adding into TargetEncoder module ability to compute target means by category in out-of-fold fashion using custom folds generator? That way, at a fitting stage...
## Expected Behavior `tests/test_encoders.py` should be running for all encoders. ## Actual Behavior CountEncoder is missing from `__all__` in` category_encoders/__init__.py`. This has the effect that tests are skipped. 11 of...
Is frequency encoding covered with any of these encoders ?
## Expected Behavior OneHotEncoder category_mapping should return `self.mapping` (that is calculated in fit() and generated by generate_mapping() ) ## Actual Behavior OneHotEncoder category_mapping returns self.ordinal_encoder.category_mapping. I think ordinal_encoder is an...