pega-datascientist-tools icon indicating copy to clipboard operation
pega-datascientist-tools copied to clipboard

Calculation of Feature Importance incorrect

Open operdeck opened this issue 5 months ago • 2 comments

pdstools version checks

  • [X] I have checked that this issue has not already been reported.

  • [X] I have confirmed this bug exists on the latest version of pdstools.

Issue description

The Feature Importance for NB models calculated by PDS tools isn't the same as in platform The R version has a subtle issue not using the right laplace smoothing (1 rather than 1/#bins) The Python version seems totally off, not calculating the diff from the mean and not scaling Platform suffers from same issues as python implementation, tracking this under BUG-880410

Reproducible example

See Excel sheet for analysis

Expected behavior

All versions should give the exact same results

Installed versions

n/a, issues have been around for a while

operdeck avatar Sep 20 '24 14:09 operdeck