ngram topic

List ngram repositories

stringdistance

75
Stars
15
Forks
Watchers

A fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard si...

colibri-core

122
Stars
20
Forks
Watchers

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...

ngram2vec

835
Stars
174
Forks
Watchers

Four word embedding models implemented in Python. Supporting arbitrary context features

daguan_2019_rank9

131
Stars
43
Forks
Watchers

datagrand 2019 information extraction competition rank9

albert_pytorch

703
Stars
152
Forks
Watchers

A Lite Bert For Self-Supervised Learning Language Representations

refinr

102
Stars
5
Forks
Watchers

Cluster and merge similar string values: an R implementation of Open Refine clustering algorithms

ngram-type

182
Stars
31
Forks
Watchers

Touch typing trainer using N-grams as data source, with options to customize the auto-generated lessons and specify the minimum typing performance needed. There are sound/color effects as well.

ngram

70
Stars
23
Forks
Watchers

Fast n-Gram Tokenization

n-gram

74
Stars
20
Forks
Watchers

Get n-grams from text