mljar-supervised icon indicating copy to clipboard operation
mljar-supervised copied to clipboard

Incorrect AUC value in CatBoost chart with sample_weight

Open offchan42 opened this issue 2 years ago • 4 comments

I trained a dataset with sample weight using 3 algos: LightGBM, Xgboost, and CatBoost. I found that the learning curve chart for CatBoost doesn't take into account the sample weight but the score in the table does. Maybe you forgot to put sample_weight for CatBoost charts? I also see the problems in the ROC curve chart (but it's the same behavior among all models). Also, could this affect the training result e.g. terminating at the wrong place? Because I saw the model trained for many iterations.

image image

offchan42 avatar May 09 '22 00:05 offchan42

@off99555 thank you for reporting. Is it the problem only for CatBoost?

pplonski avatar May 09 '22 06:05 pplonski

@pplonski Yes, I think so. At least LightGBM and Xgboost don't seem to have this problem (wrong metric in the learning curve chart).

offchan42 avatar May 09 '22 11:05 offchan42

kindly to ask is there somebody working on this issue? If not, I'm glad to undertake it @pplonski

alencn1024 avatar Aug 20 '22 05:08 alencn1024

@alencn1024 thanks for looking into it!

pplonski avatar Aug 20 '22 05:08 pplonski