pySBD issues

node.js port ?

3

Hello @nipunsadvilkar , Thank you for your efforts to port Ruby library to Python. Do you see any benefit it to port JavaScript (node.js) library as well? And I wonder...

chopinml

make spaCy requirement more explicit

**Describe the bug** The requirement for spaCy 2.1.8 should be made more explicit (e.g., in new [requirements.txt](https://github.com/nipunsadvilkar/pySBD/blob/master/requirements.txt)). Currently, this is only in the benchmarking requirements (e.g., [requirements-benchmark.txt](https://github.com/nipunsadvilkar/pySBD/blob/master/requirements-benchmark.txt)). **To Reproduce** Steps...

thomas-onesourceregulatory

Exception when clean=True in search_for_connected_sentences

2

**Describe the bug** Segmenter will raise "exception: bad escape (end of pattern) at position" when it is initialized with clean=True and it encounters a sentence like "etc.Png,Jpg,.\\" (word/token that contains...

balazik

matthen

bug

edge-cases

How is accuracy on OPUS-100 computed?

1

Hi! Thanks for this library. Since there is no notion of documents in the OPUS-100 dataset it is not clear to me how accuracy is computed. I tried a naive...

bminixhofer

pySBD
pySBD copied to clipboard

Metadata

node.js port ?

make spaCy requirement more explicit

Exception when clean=True in search_for_connected_sentences

Chinese segmenter's unexpected output

PyBSD vs PolyGlot

Spacy integration example is broken

Trim sentences

Catastrophic backtracking in HTMLTagRule

Infinite loop?

How is accuracy on OPUS-100 computed?

← Metadata

Owner

Metadata

pySBD pySBD copied to clipboard

Metadata

← Metadata

Owner

Metadata

pySBD
pySBD copied to clipboard