scikit-learn-intelex icon indicating copy to clipboard operation
scikit-learn-intelex copied to clipboard

MAINT: aligning gpu interfaces y param with sklearn

Open ethanglaser opened this issue 1 year ago • 4 comments

Description

scikit-learn provides dummy y variable in cluster/compute (example) interfaces for general alignment - we mostly have this implemented in our daal4py/sklearnex cpu interfaces already. It's not used in sklearn or in sklearnex but provides uniform interfaces and will be useful in resolving common sklbench errors.

ethanglaser avatar Feb 21 '24 21:02 ethanglaser

/intelci: run

ethanglaser avatar Feb 21 '24 21:02 ethanglaser

/intelci: run

ethanglaser avatar Feb 22 '24 00:02 ethanglaser

@ethanglaser could you please reference sklearn_bench where this fix is required?

samir-nasibli avatar Feb 22 '24 10:02 samir-nasibli

@ethanglaser could you please reference sklearn_bench where this fix is required?

Is it required for spmd ifaces?

samir-nasibli avatar Feb 22 '24 10:02 samir-nasibli

@ethanglaser could you please reference sklearn_bench where this fix is required?

Is it required for spmd ifaces?

It mostly depends on config files setup, but in cases where same datasets are used for multiple algos (ie higgs for kmeans, linreg) it will pass dummy y data and breaks spmd interfaces that dont have dummy data in this case. Since this would align with our d4p/cpu and stock sklearn interfaces, and avoids this issue, it probably would be good do here.

ethanglaser avatar Feb 22 '24 17:02 ethanglaser

Any objections on this? Is it okay with API or is it considered breaking?

ethanglaser avatar Feb 26 '24 22:02 ethanglaser