PolyFuzz icon indicating copy to clipboard operation
PolyFuzz copied to clipboard

results of get_matches() are not sorted by similarity score for all the values

Open ashutosh486 opened this issue 1 year ago • 3 comments

Hi,

I was running polyfuzz tfidf model to get the matches but few rows of the result was not sorted as per the top_n similarity score.

tfidf_model = PolyFuzz(tfidf_matcher)
tfidf_model.match(from_list, to_list)
tfidf_model.get_matches()

eg:

From To Similarity To_2 Similarity_2 To_3 Similarity_3 To_4 Similarity_4 To_5 Similarity_5
21 3 IN 1 LAVENDER & CAMOMILE 2 IN 1 LAVENDER & CAMOMILE 0.938 3 IN 1 LAVENDER & CAMOMILE 1 3 IN 1 LAVENDER 0.771 3 IN 1 LAVENDER & CHAMOMILE 0.831 LAVENDER CAMOMILE 0.764

ashutosh486 avatar Oct 28 '22 12:10 ashutosh486