LMW-tree icon indicating copy to clipboard operation
LMW-tree copied to clipboard

Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, clustering, random projections, random indexing, hashing, bit signatures

Results 26 LMW-tree issues
Sort by recently updated
recently updated
newest added

TF-IDF BM25 - probably quite useful with reflexive random indexing because it preserves the inner product space where BM25 works well Log Likelihood from TopSig paper

new feature

Some lower level unit tests for vector types and other concepts would also be useful.

testing

Switch to follow and jemalloc to replace std::string and std::vector. Might be faster. Introduce string and vector types in the lmwtree namespace. Use likely and unlikely where useful.

efficiency
refactoring
tools

https://code.google.com/p/semanticvectors/wiki/ReflectiveRandomIndexing However, no binary vector version exists yet.

new feature

Using smart pointers for cases where the pointer overhead is not critical such as tree nodes. Switch to move semantics for vectors.

refactoring

There is a TODO in the code for this.

bug
refactoring