Thibault Debatty

Results 5 repositories owned by Thibault Debatty

java-string-similarity

2.7k
Stars
400
Forks
Watchers

Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...

java-LSH

289
Stars
82
Forks
Watchers

A Java implementation of Locality Sensitive Hashing (LSH)

java-graphs

35
Stars
7
Forks
Watchers

Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...

php-language-processing

26
Stars
6
Forks
Watchers

A PHP library for language processing. Includes string distance function (Levenshtein, Jaro-Winkler,...), stemming, etc.

spark-knn-graphs

41
Stars
15
Forks
Watchers

Spark algorithms for building k-nn graphs