fuzzywuzzy icon indicating copy to clipboard operation
fuzzywuzzy copied to clipboard

Fuzzy String Matching in Python

Results 100 fuzzywuzzy issues
Sort by recently updated
recently updated
newest added

takes 30 seconds to process 1.1 million product names the npm library fuzzysort is much faster currently

I am trying to wrap my head around how it calculates the Levenshtein Distance between two strings, as the docs clearly mention that it is using that. The Levenshtein Distance...

I am trying to understand the score in fuzzy-wuzzy calculated. so for now I know it depends on SequenceMatcher from difflib package. and as shown in difflib document the score...

Using the process.extractOne and fuzz.ratio give different results in this case: ``` from fuzzywuzzy import fuzz from fuzzywuzzy import process stringToMatch = 'Florinia-SP' possibleResults = ['São Bernado do Campo-SP', 'Florínea-SP']...

A small (and perhaps pedantic) README change is required. fuzzyrusty has been renamed to fuzzywuzzy-rs as per: https://github.com/logannc/fuzzywuzzy-rs > NOTE: This project was originally named fuzzyrusty, but someone else cloned...

I am getting test failure in test_process_warning. It seems the warning is produced and captured by pytest instead: ``` test_fuzzywuzzy_pytest.py F [100%] =================================== FAILURES =================================== _____________________________ test_process_warning _____________________________ capsys =...

Hi, I am enabling ppc64le build on travis for this project. But its failing for one of the python version(pypy3.5-5.8.0). And i observed that travis build is also not generating...

I was looking for a `process` function which would return the best x matches above a certain score cutoff. After some amount of searching and scrolling through the source code,...

I apologize if this feature is already available, but I was wondering if there is currently a way to pass an argument that represents some `threshold` into one of the...

I am noticing that some of my matches in which I have one term as a subset of another term for partial_set_token come back with the non-optimal choice. For the...