scikit-learn-intelex
scikit-learn-intelex copied to clipboard
Add StandardScaler Estimator
The StandardScaler estimator scales the data to zero mean and unit variance. Use the IncrementalBasicStatistics estimator to generate the mean and variance to scale the data. Investigate where the new implementation may be low performance and include guards in the code to use Scikit-learn as necessary. The final deliverable would be to add this estimator to the 'spmd' interfaces which are effective on MPI-enabled supercomputers, this will use the underlying MPI-enabled mean and variance calculators in IncrementalBasicStatistics. This is an easy difficulty project, and would be a medium time commitment when combined with other pre-processing projects.
https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html#sklearn.preprocessing.StandardScaler