fuzzywuzzy
fuzzywuzzy copied to clipboard
Fuzzy String Matching in Python
takes 30 seconds to process 1.1 million product names the npm library fuzzysort is much faster currently
I am trying to wrap my head around how it calculates the Levenshtein Distance between two strings, as the docs clearly mention that it is using that. The Levenshtein Distance...
I am trying to understand the score in fuzzy-wuzzy calculated. so for now I know it depends on SequenceMatcher from difflib package. and as shown in difflib document the score...
Using the process.extractOne and fuzz.ratio give different results in this case: ``` from fuzzywuzzy import fuzz from fuzzywuzzy import process stringToMatch = 'Florinia-SP' possibleResults = ['São Bernado do Campo-SP', 'Florínea-SP']...
A small (and perhaps pedantic) README change is required. fuzzyrusty has been renamed to fuzzywuzzy-rs as per: https://github.com/logannc/fuzzywuzzy-rs > NOTE: This project was originally named fuzzyrusty, but someone else cloned...
I am getting test failure in test_process_warning. It seems the warning is produced and captured by pytest instead: ``` test_fuzzywuzzy_pytest.py F [100%] =================================== FAILURES =================================== _____________________________ test_process_warning _____________________________ capsys =...
Hi, I am enabling ppc64le build on travis for this project. But its failing for one of the python version(pypy3.5-5.8.0). And i observed that travis build is also not generating...
I was looking for a `process` function which would return the best x matches above a certain score cutoff. After some amount of searching and scrolling through the source code,...
I apologize if this feature is already available, but I was wondering if there is currently a way to pass an argument that represents some `threshold` into one of the...
I am noticing that some of my matches in which I have one term as a subset of another term for partial_set_token come back with the non-optimal choice. For the...