sentence-tokenizer topic

List sentence-tokenizer repositories
trafficstars

sentences

424
Stars
38
Forks
Watchers

A multilingual command line sentence tokenizer in Golang

vnlp

239
Stars
17
Forks
Watchers

State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.

pySBD

738
Stars
78
Forks
Watchers

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

bunkai

189
Stars
11
Forks
Watchers

Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)

zemberek-nlp-server

74
Stars
17
Forks
Watchers

Zemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu

punkt-segmenter

92
Stars
10
Forks
Watchers

Ruby port of the NLTK Punkt sentence segmentation algorithm

sentence-autosegmentation

37
Stars
11
Forks
Watchers

Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation

sengiri

21
Stars
5
Forks
Watchers

Yet another sentence-level tokenizer for the Japanese text

sentences

37
Stars
0
Forks
Watchers

A command-line utility that splits natural language text into sentences.