evidently icon indicating copy to clipboard operation
evidently copied to clipboard

probability classification report - does not show the entire size of data

Open Guidosalimbeni opened this issue 2 years ago • 3 comments

Hello, have been investigating your great library. However, after updating the tool and running a classification report I am noticing that the count of rows in the report is inconsistent with the data used for the calculation. I am not in the position of sharing the error but I hope is a quick thing to check on your side? I tried all the possible changes and debugging. I know for sure that the reference data has 2500 rows but the report only shows 480 records. Really not sure what else to check and any helps would be really appreciated.

Guidosalimbeni avatar May 29 '22 11:05 Guidosalimbeni

Hi @Guidosalimbeni , thank for sharing, we will try to figure it out.

I have quick questions:

  • Do I got it right, that this bug appeared in the latest version, and in the older one everything worked correctly? Or you built the dashboard in the latest version only? This will allow us to understand a little faster what the problem might come from.
  • For some reports we filter out rows with nan values, it might be the reason of the problem here. Could you please check the amount of rows with at least one nan value: df.isna().any(axis=1).sum() ? May it be, that there are 2020 rows with nan values?

emeli-dral avatar May 30 '22 17:05 emeli-dral

Hi @Guidosalimbeni,

In case it helps, these might be related: #241 and #242

Do the data include columns that have nans or np.inf values (even if they are not the target/prediction columns)? Does the count change if used with a dataset with only the target and/or prediction columns?

danieljmv01 avatar Jun 03 '22 13:06 danieljmv01

Thanks, @emeli-dral and @danieljmv01

  1. I have noticed it only in the new version but it might be that I did not notice the error in the previous version. Apologies I am not a great help here.
  2. I feel the point on Null values might likely be the issue. Let me test on Monday and I will let you know.

Guidosalimbeni avatar Jun 03 '22 14:06 Guidosalimbeni