text-processing topic

List text-processing repositories

PyKoSpacing

372
Stars
115
Forks
Watchers

Automatic Korean word spacing with Python

stringi

293
Stars
45
Forks
Watchers

Fast and portable character string processing in R (with the Unicode ICU)

TextCluster

262
Stars
62
Forks
Watchers

短文本聚类预处理模块 Short text cluster

textvec

193
Stars
25
Forks
Watchers

Text vectorization tool to outperform TFIDF for classification tasks

text-detector

178
Stars
40
Forks
Watchers

Tool which allow you to detect and translate text.

NLPre

186
Stars
34
Forks
Watchers

Python library for Natural Language Preprocessing (NLPre)

konoha

214
Stars
21
Forks
Watchers

🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.

Unix-Text-Processing

203
Stars
10
Forks
Watchers

Recreated sources for the book "UNIX Text Processing," published in 1987.

colibri-core

122
Stars
20
Forks
Watchers

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...