simdec-python icon indicating copy to clipboard operation
simdec-python copied to clipboard

Numeric boundaries for states

Open gnopik opened this issue 1 year ago • 3 comments

Numeric boundaries are formed wrong for the states of input variables that have a limited number of unique values, which results in some missing and some 'overcrowded' scenarios: image

Here is how the issue was solved in matlab: image

gnopik avatar May 29 '24 09:05 gnopik

Some are already ok after the categorical fix, but some are still wrong: hype.csv - first two variables are outputs, no NaNs hype.csv

How the dashboard shows it

image

How it should be

image

gnopik avatar Sep 06 '24 07:09 gnopik

Another example where the dashboard defines states for the categorical variable wrong, misclassifying portions of data resulting in NaN scenarios (but no NaNs in the data). [converting [1 2 3 4 5] to [1000 2000 3000 4000 5000] did not help] image data10.csv

gnopik avatar Sep 18 '24 11:09 gnopik

data.csv Another example, where the most influential categorical variable with five unique values gets misclassified into 4 states, the 5th remaining empty.

gnopik avatar Nov 22 '24 08:11 gnopik