scikit-learn-intelex icon indicating copy to clipboard operation
scikit-learn-intelex copied to clipboard

[PCA] svdDense in distributed mode does not work for unnormailized data

Open xwu99 opened this issue 3 years ago • 1 comments

oneAPI: 2021.3

Data: 0.0,1.0,0.0,7.0,0.0 2.0,0.0,3.0,4.0,5.0 4.0,0.0,0.0,6.0,7.0 8.0,0.0,1.0,5.0,7.0

For distributed version, input one line data from above dataset for a rank.

Running pca_batch with svdDense eigenvalues [[3.39562004 1.37148866 0.23289129]] eigenvectors [[-0.3902121 0.52491954 -0.32817531 0.47574208 -0.48802094] [-0.5068408 0.09939021 0.67516516 -0.40501089 -0.33667814] [-0.74898164 -0.46713755 -0.197722 0.16702423 0.3921963 ]]

Running pca_spmd with svdDense eigenvalues [[nan nan nan nan nan]] eigenvectors [[1. 0. 0. 0. 0.] [0. 1. 0. 0. 0.] [0. 0. 1. 0. 0.] [0. 0. 0. 1. 0.] [0. 0. 0. 0. 1.]]

Changed to defaultDense, everything works.

Running pca_batch with defaultDense eigenvalues [[3.39562004 1.37148866 0.23289129]] eigenvectors [[ 0.3902121 -0.52491954 0.32817531 -0.47574208 0.48802094] [ 0.5068408 -0.09939021 -0.67516516 0.40501089 0.33667814] [-0.74898164 -0.46713755 -0.197722 0.16702423 0.3921963 ]]

Running pca_spmd with defaultDense eigenvalues [[ 3.39562004e+00 1.37148866e+00 2.32891294e-01 1.02094725e-16 -2.96383754e-16]] eigenvectors [[ 0.3902121 -0.52491954 0.32817531 -0.47574208 0.48802094] [ 0.5068408 -0.09939021 -0.67516516 0.40501089 0.33667814] [-0.74898164 -0.46713755 -0.197722 0.16702423 0.3921963 ] [ 0.11966106 -0.05948796 0.62631849 0.75265846 0.15288167] [ 0.12471829 -0.70201479 -0.07130314 0.12346577 -0.68650758]]

xwu99 avatar Jul 30 '21 03:07 xwu99

Attach Reproducer

xwu99 avatar Jul 30 '21 03:07 xwu99