text-processing topic
regex-automata
A low level regular expression library that uses deterministic finite automata.
fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
pyparsing
Python library for creating PEG parsers
ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...
python-nameparser
A simple Python module for parsing human names into their individual components
whatlanggo
Natural language detection library for Go
pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks su...
bsed
Simple SQL-like syntax on top of Perl text processing.
textpipe
Textpipe: clean and extract metadata from text