LMW-tree
LMW-tree copied to clipboard
Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, clustering, random projections, random indexing, hashing, bit signatures
There is some memory overhead on vectors. std::string is around 32 bytes and we use this for a vector ID, but vectors are usually 512 bytes at most. So make...
Create a set of standard test datasets and their expected quality in terms of internal and external measures. This can be used to test for any regressions modifications may introduce....
The K-tree implementation was broken somewhere along the way. It needs to be fixed. Maybe it never worked completely in the first place.