category_encoders icon indicating copy to clipboard operation
category_encoders copied to clipboard

Handle missing in one hot encoder

Open PaulWestenthanner opened this issue 1 year ago • 3 comments

Expected Behavior

Currently, handle_missing=value adds a new column although the documentation says 'value' will encode a new value as 0 in every dummy column. Furthermore, we need a test for this

Actual Behavior

adds a column instead of using all 0

Steps to Reproduce the Problem

from category_encoders import OneHotEncoder
import pandas as pd

he = OneHotEncoder(handle_missing="value")

data = [("foo", 1), ("bar", 2), (None, 6)]
data = pd.DataFrame(data, columns=["c1", "c2"])
print(he.fit_transform(data))

Specifications

  • Version: 2.6
  • Platform: linux

PaulWestenthanner avatar Mar 12 '23 12:03 PaulWestenthanner