text-processing topic

List text-processing repositories

regex-automata

353
Stars
26
Forks
Watchers

A low level regular expression library that uses deterministic finite automata.

fastNLP

3.0k
Stars
451
Forks
Watchers

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

pyparsing

2.1k
Stars
275
Forks
Watchers

Python library for creating PEG parsers

hck

684
Stars
18
Forks
Watchers

A sharp cut(1) clone.

ekphrasis

660
Stars
92
Forks
Watchers

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...

python-nameparser

638
Stars
104
Forks
Watchers

A simple Python module for parsing human names into their individual components

whatlanggo

630
Stars
63
Forks
Watchers

Natural language detection library for Go

pynlpl

477
Stars
67
Forks
Watchers

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks su...

bsed

408
Stars
15
Forks
Watchers

Simple SQL-like syntax on top of Perl text processing.

textpipe

299
Stars
27
Forks
Watchers

Textpipe: clean and extract metadata from text