whitebox icon indicating copy to clipboard operation
whitebox copied to clipboard

Adjust the train_test_split parts in pipelines

Open stavrostheocharis opened this issue 2 years ago • 5 comments

We use "train_test_split" in "src/analytics/models/pipelines" at lines 40 & 98. In case that after the splitting only one class remains at the train set, an error will follow based on the next calculations (AUC ROC, etc.).

Need to adjust the train_test_split parts and possibly use the "stratify" argument as described inside the docs: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html

stavrostheocharis avatar Jan 05 '23 10:01 stavrostheocharis

I am willing to work on this as my first issue @stavrostheocharis @momegas

aditkay95 avatar Feb 16 '23 09:02 aditkay95

Feel free to open a PR. I assign this to you as well.

momegas avatar Feb 17 '23 09:02 momegas

Hey @aditkay95 please tell me when to move to in progress when you start working on it

momegas avatar Feb 28 '23 11:02 momegas

I would like to work on this issue @momegas and @stavrostheocharis

Mikehade avatar Mar 14 '23 14:03 Mikehade

Im assining this to you

momegas avatar Mar 15 '23 09:03 momegas