FEDOT icon indicating copy to clipboard operation
FEDOT copied to clipboard

Incorrect Binary Categorical Preprocessor work

Open andreygetmanov opened this issue 2 years ago • 2 comments

  1. Incorrectly defines categorical features from test_pipeline_for_side_task_predict (categorical_ids = [1, 2, 3, 4, 5, 7, 8], true cat_ids = [1, 2, 3, 4, 5, 6, 8]
  2. column_uniques contains 1.0 and 1 as distinguished classes, so the condition of binary decoding doesn't work correctly (column is recognized as binary, if column_uniques <= 3: two classes and possible gaps)
  3. Feature types are falsely identified (feature_types = ["int", "str", "str", "str", "str", "str", "str", "str", "str"], true feature_types = ["int", "int", "int", "int", "str", "str", "int", "str", "str"]

Due to unstable work, the results can be unpredictable, so it's needed to understand the reason of problems, fix them and cover them by unit tests

andreygetmanov avatar Aug 10 '22 16:08 andreygetmanov

image

valer1435 avatar Aug 11 '22 15:08 valer1435

Попробуй, может это решит проблему

valer1435 avatar Aug 11 '22 15:08 valer1435