spaczz
spaczz copied to clipboard
Fuzzy matching and more functionality for spaCy.
Thanks a lot for your fabulous package; it is really helpful. However, when I tried to reproduce your results, I run into this error: "UserWarning: [W036] The component 'matcher' does...
Hi, A very useful feature would be to have the original pattern matched by SpaczzRuler, because when similar patterns are added, there may be doubts about which one is the...
I am able to get this [regex](https://pythex.org/?regex=(%3Fi)proficiency%20in%20(%5B%5Cw%5Cs%5D%2B)%20tools%20like%20(.*%24)&test_string=Hello%20how%20are%20you%3F%20Proficiency%20in%20ETL%20tools%20like%20Informatica%2C%20Talend%2C%20Alteryx%20and%20Visualization%20tools%20like%20PowerBi%2C%20Tableau%20and%20Qlikview.&ignorecase=0&multiline=0&dotall=0&verbose=0) working using below code. ``` import spacy from spaczz.matcher import RegexMatcher nlp = spacy.blank("en") text = "Hello how are you? Proficiency in ETL...
Need to develop a way to return match quality details (fuzzy ratios and fuzzy regex counts) from `TokenMatcher` matches. I currently only do the fuzzy matching token patterns in spaczz...
In order to sort `SpaczzRuler` matches across the different matchers by quality, I need a method or methods for comparing fuzzy ratios (ints between 0 and 100) and fuzzy regex...
First of all, really appreciate your work and time. With small input data patterns, it is doing a good job, but when input data patterns crossing more than 1 lakh,...
Extend API to allow for adding/removing user-defined predefined regexes and fuzzy matchers.
Add pipeline component to "clean" entities after setting (primarily intended for spaczz entities). I.e. if punctuation is included at the start/end of a fuzzy matched entity the span can be...
Build out Read the Docs .rst documentation for comprehensive details.
``` NAME Grint Anderson 86 GPE Nashv1le 82 ``` @gandersen101 For both 'Grint' and 'Nashv1le' are from text, and their corresponding dictionary items are "Grant Andersen" and "Nashville". Is there...