Max Bachmann

Results 300 comments of Max Bachmann
trafficstars

This is a known issue in python-Levenshtein: https://github.com/seatgeek/fuzzywuzzy/issues/79 In your case for the comparision of ``` "prod" "random" ``` the following alignment is used: ``` "prod" "ndom" ``` which has...

This should be the same issue as https://github.com/seatgeek/thefuzz/issues/35 and should be fixed by uninstalling `python-Levenshtein`/`Levenshtein` and installing it again.

This should be solved by running ``` pip uninstall Levenshtein python-Levenshtein pip install python-Levenshtein ``` since this is a bug in pip in regards to updating. So it would be...

I do not know any specific reason for this design decision. I personally decided to go with floating point in my own implementation of the algorithms ([RapidFuzz](https://github.com/maxbachmann/RapidFuzz)), since I could...

When creating a new project you should probably just use: https://github.com/maxbachmann/RapidFuzz which is a MIT licensed and faster re-implementation of fuzzywuzzy

Given it is GPL licensed you could run into license issues. Since there is a more open licensed version you can just save yourself this hassle. If you still really...

There are a couple of important differences between the two packages: 1) In FuzzyWuzzy limit specifies how many elements you want `extract` to return. `extract` does not provide an argument...

As a I described here: https://github.com/seatgeek/fuzzywuzzy/issues/279 this is most likely caused by the automatic junk heuristic of difflib which is not deactivated by fuzzywuzzy

Hm my personal approach towards this problem would be to use something like the following: ```python def disguised_letter_fix(s): replacements = { 'vv': 'w', 'nn': 'm' } pattern = re.compile(r'(' +...

These are the docs of process.extract: > Select the best match in a list or dictionary of choices. Find best matches in a list or dictionary of choices, return a...