sentence-tokenizer topic
sentences
A multilingual command line sentence tokenizer in Golang
vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
pySBD
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
bunkai
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
zemberek-nlp-server
Zemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
punkt-segmenter
Ruby port of the NLTK Punkt sentence segmentation algorithm
sentence-autosegmentation
Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation
sengiri
Yet another sentence-level tokenizer for the Japanese text
sentences
A command-line utility that splits natural language text into sentences.
TrTokenizer
🧩 A simple sentence tokenizer.