pySBD icon indicating copy to clipboard operation
pySBD copied to clipboard

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Results 33 pySBD issues
Sort by recently updated
recently updated
newest added

I have installed current spacy version 3. 1 and running the example with some modifications but it keeps throwing error of `ValueError: [E030] Sentence boundaries unset. You can add the...

Hello! In using `pySBD`, I've noticed that the current example script no longer works with the latest version of SpaCy (`3.3.0`). This is the traceback I get: ``` Traceback (most...

PySBD component using Language.factory

enhancement

**Describe the bug** Arabic sentence split on the Arabic comma. **To Reproduce** Steps to reproduce the behavior: ``` import pysbd text = "هذه تجربة، للغة العربية" seg = pysbd.Segmenter(language="ar", clean=True)...

Hey there, So ive been using pysbd to detect boundries in hindi and marathi language and then save the same data rearranged from a paragraph to one sentence boundry per...

Thanks for a great sentence splitting package. A small contribution, after troubleshooting, why the code was not working out of the box. The spacy v3 requires a string in the...

e.g. just curious about those special cases also are abreviations handled too? AKA writing i.e and not i.e. or e.g and not e.g.

Update the example to work with the latest spacy as installed by `pip install spacy` (version 3.0.6), and fix failure to segment sentences due to `doc.char_span` returning None. Fixes #96....