RapidFuzz issues

Add support for Smith Waterman algorithm

4

The [Smith Waterman algorithm](https://en.wikipedia.org/wiki/Smith%E2%80%93Waterman_algorithm) is a commonly used metric to compare strings. It would be useful to add it to RapidFuzz.

maxbachmann

enhancement

Could you please provide compatibility with Cython 0.29.x?

11

I've been trying to package RapidFuzz for Gentoo. Unfortunately, we're nowhere near close to being ready to switch to Cython 3.x, so the requirement on alpha version of Cython makes...

mgorny

enhancement

partial_ratio breaks when short string is tool long

1

Hi, Thanks for writing rapidfuzz - it's been really helpful for me. I noticed some unexpected behaviour in the partial_ratio function. In my case, when the length of the shorter...

rfara

bug

Semipartial ratio

I’d love to see ratio/distance functions for Levenshtein matching bound to either the beginning or the end of the longer string. (In the sense that partial_ratio is unbound in both...

M0rtenB

enhancement

fuzz.token_set_ratio, a possible bug, at least not the same as doc

8

Dear Max Bachmann, today, I nice to find your package, yes better than fuzzywuzzy. thanks! I find a possible bug on fuzz.token_set_ratio which claims "Compares the words in the strings...

rocke-dong

question

add function to return all alignments

Currently there is only a functions editops/opcodes, which returns one possible optimal alignment. However there can be more than one optimal alignment. It would make sense to add the possibility...

maxbachmann

enhancement

Add support for Damerau Levenshtein distance

1

The Damerau Levenshtein distance is a a commonly used metric to compare strings. Support for this could be added based on https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.142.1245&rep=rep1&type=pdf.

maxbachmann

enhancement

Implement banded version of InDel distance

A banded version of the Levenshtein distance algorithm should be implemented as described in http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.59.6975&rep=rep1&type=pdf. This would reduce the runtime of scorers like `fuzz.ratio` from `O(N/64 * M)` to `O(score_cutoff/64...

maxbachmann

performance

Add Build to Cygwin

It would be helpful to add RapidFuzz as a Cygwin package, so no compilation is required for Cygwin users: https://github.com/spotDL/spotify-downloader/issues/1306

maxbachmann

enhancement

help wanted

Improve ReadMe

The current readme does not reflect many of the most recent changes. It would make sense to update it accordingly: - [ ] From the current readme it appears like...

maxbachmann

documentation

help wanted

RapidFuzz
RapidFuzz copied to clipboard

Metadata

Add support for Smith Waterman algorithm

Could you please provide compatibility with Cython 0.29.x?

partial_ratio breaks when short string is tool long

Semipartial ratio

fuzz.token_set_ratio, a possible bug, at least not the same as doc

add function to return all alignments

Add support for Damerau Levenshtein distance

Implement banded version of InDel distance

Add Build to Cygwin

Improve ReadMe

← Metadata

Owner

Metadata

RapidFuzz RapidFuzz copied to clipboard

Metadata

← Metadata

Owner

Metadata

RapidFuzz
RapidFuzz copied to clipboard