kmcuda icon indicating copy to clipboard operation
kmcuda copied to clipboard

weird problem

Open lijiawenl opened this issue 7 years ago • 6 comments

The data i used is [183497*600] , i set the k = 4590 and i get the error : 'internal bug inside kmeans_init_centroids: dis_num is NaN ' '/src/kmeans.cu:814->'an illegal memory encounted' 'cudaMempy failed' . but i set the k equal other number, like k = 4591, the kmeans works. - -! i thought this might occur randomly I have no idea where the problem is.

lijiawenl avatar Aug 29 '18 04:08 lijiawenl

Check your input for NaNs. If there are too many - they creep into centroids.

vmarkovtsev avatar Aug 29 '18 06:08 vmarkovtsev

Otherwise I need your data to reproduce the problem.

vmarkovtsev avatar Aug 29 '18 06:08 vmarkovtsev

Thanks for your reply. you mean the data i used have NaNs, but it does not make sense that when i set k=4591, it works. Could you send me a email-address that i can send the data to you.

lijiawenl avatar Aug 29 '18 09:08 lijiawenl

Many thanks.

lijiawenl avatar Aug 29 '18 09:08 lijiawenl

There is a certain tolerance of NaNs during the centroids initialization but it is not bulletproof.

My email is written on my profile page.

vmarkovtsev avatar Aug 29 '18 09:08 vmarkovtsev

I seem to have a similar problem: the centroids randomly contain NaNs although my data doesn't. Repeating the clustering over and over again until no NaNs are in the centroids anymore kind of solves the issue. (350K, 15 dim, around k=100 clusters)

Is there any progress on this issue?

georgandreasjaksch avatar Nov 09 '18 07:11 georgandreasjaksch