devikasondhi

Results 7 issues of devikasondhi

Hello, This is a peculiar case of enumeration where the tokenization gives inappropriate results. ```en.tokenize('See Section 3.) Or Section 2.)')``` results in ```['See Section 3 .', ') Or Section 2...

Hello, Here are some scenarios where ```is_url``` shows unexpected behaviour: 1) ```http://google.abcdefghi``` is valid but ```is_url``` states ```False``` in its test suite- constraining TLD to length 6 but TLD length...

Hello, I'm listing some scenarios where the ```is_email``` fails: 1) domain with localhost not accepted by is_email: ```email@localhost```, ```email@[127.0.0.1]``` are valid while the function returns False 2) unicode not handled-...

Hello, The current is_ip and is_ip_v6 functions do not validate an ipv4 mapped ipv6 address. For instance, ```is_ip("7::128.128.0.127")``` returns ```False``` while it should return ```True```.

Hello, Referring to the text sample ```We will meet at eight o'clock on Thursday morning.``` in POS.ipynb notebook, shouldn't the token ```o'clock``` be tagged as an adverb instead of a...

Hello, The PoS tagger does not seem to take into account the presence of possesive ending ('s) >>> blob = "John's big idea isn't all that bad." >>> text =...

As per the specifications (https://www.w3.org/TR/REC-xml/#charsets), control characters under 0x20 (except #x9, #xA, #xD) should be omitted in XML documents. However, XmlEscapers.xmlContentEscaper() does not account for this. Steps to reproduce: ```...

type=api-docs
status=triaged
package=escape
P3