string_categorical_encoders
string_categorical_encoders copied to clipboard
missing values in categorical columns break minhash encoder
If you have a column with missing values, then, the nan values will cause an error in the minhash encoder in this step:
string = spaces + " ".join(string.lower().split()) + spaces
I'm not sure what the best way to handle it is. I might replace all missing values with blank spaces before hand.