fuzzywuzzy-rs icon indicating copy to clipboard operation
fuzzywuzzy-rs copied to clipboard

port of https://github.com/seatgeek/fuzzywuzzy

Results 13 fuzzywuzzy-rs issues
Sort by recently updated
recently updated
newest added

essentially all of the algorithms in this crate are poorly suited to unicode because they iterate over the `char`s in the string instead of the graheme clusters. https://crates.io/crates/unicode-segmentation is the...

1.0.0
conformance

Add an explicit type definition `Match` for matches returned by `process` methods (containing matched text and a score). This is helpful when adding behavior involving sorting or comparison, and provides...

1.0.0

See the [original implementation](https://github.com/seatgeek/fuzzywuzzy/blob/master/fuzzywuzzy/process.py#L85) where the processor function is only executed once if it's a known method. I'm not sure if this is an optimization to prevent duplicate calls, or...

bug
performance
conformance

See https://github.com/maxbachmann/rapidfuzz/issues/13 for generous details from another fuzzywuzzy compatible project author. Essentially, `partial_ratio` attempts to align strings optimally, then take the ratio of the aligned string subsets. The method of...

enhancement
performance

Based on #26 ([hash](https://github.com/logannc/fuzzywuzzy-rs/pull/26/commits/cf2765db891fe4c13536c5525832166a02ff4ea9#diff-5665c6a1451a77c999912ea3811525be2d79e593452d44ae442c1d0422de52b0R40)) > I'd like to convert this function to use MatchingStreak's internally. > It might make it more clear to be comparing low1 < streak.idx1 instead of...

bug
enhancement

The wisdom for a while has been "anyhow" for binaries, "thiserror" for libraries. The macros from thiserror are simply and non-magic enough that I feel comfortable using them normally. _Originally...

enhancement

The only standard library uses we currently have are `HashSet`, for which there are [excellent replacements](https://crates.io/crates/hashbrown/0.1.0), and `std::cmp::{min, max}` which are obviously functions we can live without.

enhancement

Opened based on the discussion in #23 and in [this comment](https://github.com/logannc/fuzzywuzzy-rs/pull/23#issuecomment-734502364). > "Well, I'm dissatisfied with the options available for handling unicode. In the same way we allow alternative scorers,...

enhancement

fixes #24 fixes #22 fixes #20 fixes #7 (mostly - once you have extractWithoutOrder, the rest are basically just 'get top N' which can be done by callers) #6 might...

wip

Missing the following methods from [the original process implementation](https://github.com/seatgeek/fuzzywuzzy/blob/master/fuzzywuzzy/process.py): - `extract` - `extractBests` - `dedupe`

1.0.0
conformance