punkt-segmenter
punkt-segmenter copied to clipboard
Ruby port of the NLTK Punkt sentence segmentation algorithm
Hello, I recently did some work [porting your code to Go](https://github.com/harrisj/punkt), and one of the things I added was code to load the preset language_parameters stored in pickle files within...
The following expression raises a `Math::DomainError` (`Math.log(-Infinity)`) `Punkt::SentenceTokenizer.new("08. 94 01. 95")`
Find an alternative to Unicode_utils for ruby 1.8.x
Pre-requisite to go to version 1.0.0, at least coverage all algorithm parts ported from Python.