spaczz icon indicating copy to clipboard operation
spaczz copied to clipboard

Fuzzy matching and more functionality for spaCy.

Results 18 spaczz issues
Sort by recently updated
recently updated
newest added

Thanks a lot for your fabulous package; it is really helpful. However, when I tried to reproduce your results, I run into this error: "UserWarning: [W036] The component 'matcher' does...

Hi, A very useful feature would be to have the original pattern matched by SpaczzRuler, because when similar patterns are added, there may be doubts about which one is the...

I am able to get this [regex](https://pythex.org/?regex=(%3Fi)proficiency%20in%20(%5B%5Cw%5Cs%5D%2B)%20tools%20like%20(.*%24)&test_string=Hello%20how%20are%20you%3F%20Proficiency%20in%20ETL%20tools%20like%20Informatica%2C%20Talend%2C%20Alteryx%20and%20Visualization%20tools%20like%20PowerBi%2C%20Tableau%20and%20Qlikview.&ignorecase=0&multiline=0&dotall=0&verbose=0) working using below code. ``` import spacy from spaczz.matcher import RegexMatcher nlp = spacy.blank("en") text = "Hello how are you? Proficiency in ETL...

Need to develop a way to return match quality details (fuzzy ratios and fuzzy regex counts) from `TokenMatcher` matches. I currently only do the fuzzy matching token patterns in spaczz...

enhancement

In order to sort `SpaczzRuler` matches across the different matchers by quality, I need a method or methods for comparing fuzzy ratios (ints between 0 and 100) and fuzzy regex...

enhancement

First of all, really appreciate your work and time. With small input data patterns, it is doing a good job, but when input data patterns crossing more than 1 lakh,...

help wanted
performance

Extend API to allow for adding/removing user-defined predefined regexes and fuzzy matchers.

enhancement

Add pipeline component to "clean" entities after setting (primarily intended for spaczz entities). I.e. if punctuation is included at the start/end of a fuzzy matched entity the span can be...

enhancement

Build out Read the Docs .rst documentation for comprehensive details.

documentation

``` NAME Grint Anderson 86 GPE Nashv1le 82 ``` @gandersen101 For both 'Grint' and 'Nashv1le' are from text, and their corresponding dictionary items are "Grant Andersen" and "Nashville". Is there...