Chris de Vries

Results 26 issues of Chris de Vries

TF-IDF BM25 - probably quite useful with reflexive random indexing because it preserves the inner product space where BM25 works well Log Likelihood from TopSig paper

new feature

Some lower level unit tests for vector types and other concepts would also be useful.

testing

Switch to follow and jemalloc to replace std::string and std::vector. Might be faster. Introduce string and vector types in the lmwtree namespace. Use likely and unlikely where useful.

efficiency
refactoring
tools

https://code.google.com/p/semanticvectors/wiki/ReflectiveRandomIndexing However, no binary vector version exists yet.

new feature

http://arma.sourceforge.net/

efficiency

Using smart pointers for cases where the pointer overhead is not critical such as tree nodes. Switch to move semantics for vectors.

refactoring

There is a TODO in the code for this.

bug
refactoring