Grant Andersen

Results 14 comments of Grant Andersen

Will only support adding user predefined regex patterns. This follows the current long-term design that will eventually see the fuzzy search process implemented in Cython.

Hi @karrtikiyer. Spaczz's regex matching essentially just extends on the spaCy's docs own recommendations for [applying regex to a full text](https://spacy.io/usage/rule-based-matching#regex-text). You can see the same ideas implemented in spaczz's...

Hi @kormilitzin, thanks for your interest in spaczz! The warning you are seeing normally happens because of the way I have implemented the `TokenMatcher` (which is part of the `SpaczzRuler`)...

Hi @Lostincodes, I've been in a busy stretch at work so thanks for your patience. So this currently is not a part of spaczz because it isn't part of spaCy...

Hi @wTaylorBickelmann. I believe what is happening when you have the `en_core_web_sm` model in the pipeline then add the `SpaczzRuler`, is that the ruler is being added to the end...

Hi @sasi143, thanks for your interest in Spaczz. I am very interested in improving the speed of the fuzzy matching process, however, for reasons I'll outline below, I unfortunately do...

Issue #41 has turned into a performance discussion and I am planning to make some performance improvements very soon. I will provide a summary of those changes on this thread...

Keep going back and forth on this, but decided this will not be a pipeline component but will be inherited by the fuzzy searcher and regex searcher.

@kwhumphreys very cool. Best of luck! Obviously I have not put much time into `spaczz` over the past couple years but the functionality is something people have been looking for....

Closed by #81. `spaczz` v0.6 now returns original patterns.