clustering
clustering copied to clipboard
make kmeans distance fun configurable, add cosine sim
I'm a bit reluctant to incorporate this change. K-means is not compatible with arbitrary distance functions. See discussion here: https://stats.stackexchange.com/questions/81481/why-does-k-means-clustering-algorithm-use-only-euclidean-distance-metric.