sensAI icon indicating copy to clipboard operation
sensAI copied to clipboard

FeatureGenerators: Support normalisation alongside categorical features that are not encoded

Open opcode81 opened this issue 11 months ago • 0 comments

Right now, we cannot deal with this when using categorical default rules, as they will render the categorical features unsupported rather than ignored.

While I am not aware of models that benefit from both unencoded categorical features and normalisation, we could still consider adding this, especially since users users are likely to apply unnecessary normalisation to, for example, tree-based models.

Proposal:

  • make add_categorical_default_rules accept not only bool but also an enum CategoricalRule with items UNSUPPORTED, IGNORED, NONE.
  • handle it appropriately
  • pass add_categorical_rules through to further subclasses, in particular TakeColumns.

@schroedk FYI

opcode81 avatar Feb 27 '24 18:02 opcode81