string_categorical_encoders icon indicating copy to clipboard operation
string_categorical_encoders copied to clipboard

missing values in categorical columns break minhash encoder

Open FedericoV opened this issue 5 years ago • 0 comments

If you have a column with missing values, then, the nan values will cause an error in the minhash encoder in this step:

string = spaces + " ".join(string.lower().split()) + spaces

I'm not sure what the best way to handle it is. I might replace all missing values with blank spaces before hand.

FedericoV avatar Jul 25 '19 20:07 FedericoV