text-processing topic
PyKoSpacing
Automatic Korean word spacing with Python
stringi
Fast and portable character string processing in R (with the Unicode ICU)
TextCluster
短文本聚类预处理模块 Short text cluster
textvec
Text vectorization tool to outperform TFIDF for classification tasks
text-detector
Tool which allow you to detect and translate text.
NLPre
Python library for Natural Language Preprocessing (NLPre)
konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Unix-Text-Processing
Recreated sources for the book "UNIX Text Processing," published in 1987.
colibri-core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dy...