auto-sklearn icon indicating copy to clipboard operation
auto-sklearn copied to clipboard

[Research] Use grouping of infrequent categories in `OneHotEncoder`

Open eddiebergman opened this issue 3 years ago • 0 comments

Feature preprocessing.OneHotEncoder now supports grouping infrequent categories into a single feature. Grouping infrequent categories is enabled by specifying how to select infrequent categories with min_frequency or max_categories. https://github.com/scikit-learn/scikit-learn/pull/16018 by Thomas Fan.

Mentioned in #1611

This would need to be researched but could perform great opportunities in low memory / high feature class count cases

eddiebergman avatar Nov 15 '22 15:11 eddiebergman