Grant Andersen
Grant Andersen
Will only support adding user predefined regex patterns. This follows the current long-term design that will eventually see the fuzzy search process implemented in Cython.
Hi @karrtikiyer. Spaczz's regex matching essentially just extends on the spaCy's docs own recommendations for [applying regex to a full text](https://spacy.io/usage/rule-based-matching#regex-text). You can see the same ideas implemented in spaczz's...
Hi @kormilitzin, thanks for your interest in spaczz! The warning you are seeing normally happens because of the way I have implemented the `TokenMatcher` (which is part of the `SpaczzRuler`)...
Hi @Lostincodes, I've been in a busy stretch at work so thanks for your patience. So this currently is not a part of spaczz because it isn't part of spaCy...
Hi @wTaylorBickelmann. I believe what is happening when you have the `en_core_web_sm` model in the pipeline then add the `SpaczzRuler`, is that the ruler is being added to the end...
Hi @sasi143, thanks for your interest in Spaczz. I am very interested in improving the speed of the fuzzy matching process, however, for reasons I'll outline below, I unfortunately do...
Issue #41 has turned into a performance discussion and I am planning to make some performance improvements very soon. I will provide a summary of those changes on this thread...
Keep going back and forth on this, but decided this will not be a pipeline component but will be inherited by the fuzzy searcher and regex searcher.
@kwhumphreys very cool. Best of luck! Obviously I have not put much time into `spaczz` over the past couple years but the functionality is something people have been looking for....
Closed by #81. `spaczz` v0.6 now returns original patterns.