string_distance_metrics icon indicating copy to clipboard operation
string_distance_metrics copied to clipboard

A set of Python string distance metrics for string distance comparisons

A set of simple string distance metrics including:

  • Levenshtein edit distance (using http://pypi.python.org/pypi/python-Levenshtein/) (0+)
  • Jaro Winkler, Jaro, Ratio distances (0+)
  • Title string lengths (0+)
  • Uni/bi/trigram distances (0.0-1.0)
  • Cosine distance (0.0-1.0)

See test_string_distance_measures.py for usage examples. Each method takes two strings and returns a distance >= 0, some are bounded by 1.0, others aren't.