text-segmentation topic

List text-segmentation repositories

HarvestText

2.3k
Stars
330
Forks
Watchers

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

ekphrasis

660
Stars
92
Forks
Watchers

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashta...

SymSpell

3.1k
Stars
281
Forks
Watchers

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

symspellpy

772
Stars
116
Forks
Watchers

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

awesome-ocr-resources

398
Stars
73
Forks
Watchers

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

deepsegment

301
Stars
57
Forks
Watchers

A sentence segmenter that actually works!

text-segmentation

242
Stars
57
Forks
Watchers

Implementation of the paper: Text Segmentation as a Supervised Learning Task

WordSegmentationTM

76
Stars
13
Forks
Watchers

Fast Word Segmentation with Triangular Matrix

awesome-topic-segmentation

106
Stars
13
Forks
Watchers

(yet another not really) awesome topic/text segmentation list