pragmatic_segmenter icon indicating copy to clipboard operation
pragmatic_segmenter copied to clipboard

Quotation mark at the beginning of a sentence breaks segmentation

Open arp opened this issue 2 years ago • 0 comments

Example:

 text = '"These should be two different sentences. Of course."'
 s = PragmaticSegmenter::Segmenter.new(text: text)
 s.segment
 
 # RETURNS:
 ["\"These should be two different sentences. Of course.\""]
 
 # SHOULD RETURN:
 ["\"These should be two different sentences.",  "Of course.\""]

arp avatar Dec 18 '23 07:12 arp