RapidFuzz issues

Add BK Tree implementation

2

It would make sense to add a BK Tree implementation for `scorers` which full fill the triangle inequality. This would provide massive performance improvements for things like searches. https://dl.acm.org/doi/10.1145/362003.362025

maxbachmann

enhancement

performance

add SIMD support to more functions in the process module

SIMD support is still missing for: - [ ] process.extractOne - [ ] process.extract - [ ] process.cdist when both sequences are similar. - [ ] process.extract_iter

maxbachmann

performance

Different results on Windows and Linux? Linux didn't supported?

4

I run same code on pycharm and Linux, but I get different results, python: from rapidfuzz import fuzz score= fuzz.token_set_ratio("It is an apple", "It is an apple juice") print(score) In...

OAE69

question

Adds examples to token_set_ratio

I've added two examples to the `fuzz.token_set_ratio` docs: One showing if one string is a subset of the other, it will return 100.0 and the other showing a divergence from...

jlb52

AttributeError: module 'rapidfuzz.process' has no attribute 'cpdist'

1

I got the latest version of the library, but I get the following error: AttributeError Traceback (most recent call last) Cell In [55], line 3 1 from rapidfuzz import process...

Alireza-G-Qre

1 test fails

5

``` ========================================================================================= FAILURES ========================================================================================== _________________________________________________________________________________ test_large_prefix_weight __________________________________________________________________________________ def test_large_prefix_weight(): > assert pytest.approx(JaroWinkler.similarity('milyarder', 'milyarderlik',prefix_weight=0.5)) == 1.0 tests/distance/test_JaroWinkler.py:13: _ _ _ _ _ _ _ _ _ _ _ _ _ _...

yurivict

bug

Quick Question

4

Where can I see the implementation of .partial_ratio() ? Can you let me know the logic which is utilized for this method. Thanks in advance!

kd10041

documentation

question

"import rapidfuzz" causes "illegal hardware instruction"

20

While I had used rapidfuzz previously without issues in my Python scripts, now every script that uses it crashes with an error such as `illegal hardware instruction` (Console) or `Terminated...

workflowsguy

bug

ci: try macos-12

Trying macos-12.

henryiii

Could partial_ratio support Levenshtein.normalized_similarity?

2

rapidfuzz 3.9.6 and python 3.10. I have carefully read the rapidfuzz docs and made tests. I find: partial_ratio use Indel.normalized_similarity. Could partial_ratio also support Levenshtein.normalized_similarity? Maybe default to use Indel.normalized_similarity,...

rocke2020

enhancement

RapidFuzz
RapidFuzz copied to clipboard

Metadata

Add BK Tree implementation

add SIMD support to more functions in the process module

Different results on Windows and Linux? Linux didn't supported?

Adds examples to token_set_ratio

AttributeError: module 'rapidfuzz.process' has no attribute 'cpdist'

1 test fails

Quick Question

"import rapidfuzz" causes "illegal hardware instruction"

ci: try macos-12

Could partial_ratio support Levenshtein.normalized_similarity?

← Metadata

Owner

Metadata

RapidFuzz RapidFuzz copied to clipboard

Metadata

← Metadata

Owner

Metadata

RapidFuzz
RapidFuzz copied to clipboard