simhash topic
stopwords
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
nlp
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
python-hashes
Interesting (non-cryptographic) hashes implemented in pure Python.
simhash-java
A simple implementation of simhash algorithm by java.
gosimhash
A simhasher for Chinese documents implemented by golang, simply translated from yanyiwu/gosimhash
simhash-js
Simhash implementation in Javascript
semantic-sh
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
spirit_fingers
Elixir SimHash NIFs written in Rust
superminhash
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex